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Method and system for providing alternatives tor text derived from stochastic Input sources 



(57) A computer-implemented method for providing 
a candidate list of alternatives for a text selection con- 
taining text from multiple input sources, each of which 
can be stochastic (such as a speech recognition unit, 
handwriting recognition unit, or input method editor) or 
non-stochastic (such as a keyboard and mouse). A text 
component of the text selection may be the result of data 
processed through a series of stochastic input sources, 
such as speech input that is converted to text by a 
speech recognition unit before being used as Input Into 
an Input method editor. To detemriine alternatives for the 
text selection, a stochastic input combiner parses the 
text selection into text components from different Input 



sources. For each stochastic text component, the com- 
biner retrieves a stochastic model containing altema- 
tives for the text component, if the stochastic text com- 
ponent is the result of a series of stochastic Input sourc- 
es, the combiner derives a stochastic model that accu- 
rately reflects the probabilities of the results of the entire 
series. The combiner creates a list of alternatives for the 
text selection by combining the stochastic models re- 
trieved. The combiner may revise the list of alternatives 
by applying natural language principles to the text se- 
lection as a whole. The list of alternatives for the text 
selection Is then presented to the user. If the user choos- 
es one of the alternatives, then the word processor re- 
places the text selection with the chosen candidate. 
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tl*hX1l i'^nmih 9- HTD-fe <yt{CtJ ((^THitt 
I.. «S:coA^jjSt(±. ^N^iiaiA:»}jlSfcJ:U''ttilA:'3jS 
Sr^tJ. ;K-H*5J:l/-7'>x{cJ:Sr^xhA;'3 
14. ^HtfflA^J^SK^fi^TAS. ^^*ftSIA:/jaS<^J^. 3 
y fa - <4 J.— !f *<S0 tJt T='f X h i&m(C«^T 

--H. mmx-nwii. x-))imm^>.zm?i-t 
i'^(r>wmmimi(Dmmi:^L.^'m<r>im 
^mmiioQ%i:xfThh. imx-num\i. # 
nmmx'hh. ^Pt^mi. ^pxfii^^x 
himoxhiizmstti. 3>'tfjL-^'*<#^5-^(c 
mtti z. b ^i^mt^:^}^(^>x'$> h . %mxi^wrM<r> 

#S#^i^afcJ:tXA:fti^x-rA ( IME) r 



[00 23]a— f{4. V-HrP-b-y-y-Sri^oTr^ 

:^'7x-x$•:f^LT-e^^<oiSffli^•«5K-r2.. i^c7)io 
[ 0 0 2 4 ] -7- h-ro-fe v-9-*itiett ^.iRfflyx 
\i. ifieati?-f^xh«t><oiiss-s35'f^. •rsi:'?- 

[0025] lta!IA:'3te-^<4. rn/^A^ 

$rie-&L. MiRT^xi^-kmzm-mmv xhif^^ 

[0026] «a!IA:']fe-^lU4, v-Hro-tr y-trr 

[ 0 0 2 7 ] 7- Hrn-b-y-tu. as?7^#;:^ hmm 

wm^TMi, ^T^xhrnmizsmmmsimm-r 
hzbizji*). mmvxhj/^mizmi-imm'f:rfi<7) 

[0028] h^mu. m^itzmixiimiTt 

dz-th Zbhhh. ':}f^j:<bLl ocoitilA^Jiil*>*ta!l 
fe«S:^L. ^tii:<X(mmXijm.^coX:fJbti>. 

mcommx:hwuT^xhmmm(r>mm^:im-i>. m 
mA:ht&-^mi. T^xhmm^zMm•ti>ml^:is^t^ 
mmm^trii^^M^^i-h, zcr>m^izi5\,^x<x<r)ms 
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(Dl^^cr,Mm$:J.-^l,z^ib^j:\,\ 

[0029] itsjA:'3te^S5{4. 7-^x hwrnm^m. 

:3.-fX:hmmi. -eoi— fA^Srag^Ly^JtaJA 

mm 1 mxm^mi . mzm^x-n^-^mi, 
m 1 tmxijm^ Uz^mii:m2imxm<r>x 
:hb-rh. mmmxmti. ^m-yfcmm^)^^*^ 
(,mm^^m,th. m2mixmt'(><7)mm^ 
li. m-^mwjxhiztbi^i'jitmmh. ^(>izm 
mxmtf^^^m-tm. mmxm-^m. m^^m 
*Jxh<7)^mi:fli3imx:hm^(0X:hbti>. im 
i^imxMzmimk<r>mmxijm>(>i^A-ri 
i^-kmsvxhii. 'Mum^T/ux'hi. 

[0030] JtaiA?)jgi<0x5-{4. mi<r>mizhrzh 

zbtfi^^\ ^--i'lmimmizm^x . -!-<ox9-<r) 

il (recognize. l^ay-t^X) j bnMLfz 

1^. ^pmm^yi^yii(mmib ix ^mmmm 

(wreck a n i c e . U -y^Ti-^ X) j b^ 

m< ^mmiii}^io^¥]mThizbizm\>^x\.^ 

[0031] ^iEm:mmi:'^iib-riza.-^'i3-^'^x 

himUl. '7-\(ra-t'y'^tlt-^<mmr-^Xh<7)^ 

i^\'^xmmuxh^f^^-tix.i£. ic^'jx hiiJL-i' 
a^MmuzT^x hizMm-tmmt: i otjt^c^i^t- 

[0032] Z<7)j: 0 ^j:TM^i:mM-t&fzib. imx 

iJm^^i. mRT^xhimjEmw=f:f>\^^i,, m 

h. mi ffgiij (nmxu, m]Emcr)mwj:s^&T^x 

fzX:h:fimizm-th=erf\^bi)i^tim^i:^mti>. X 

m^xm^^^r)v. ^ ■ $m^^mm=tT 

[0033] m^sm^trMi. ^■sm&^m&t^^ 
bmmthb. lojjuKo^e-f^xhs-i^^s. i 
(^'^Lfzm^T^xvi^znLx . w^x-nm^mm 
tmm.izh^^x'mi^-fh. mmm^rMi. z 

[0034] mmi%m.Ltcti^t=^%^m%im^mm 

[003 51 illf^il^J 



0 1 S:#BSL^*^%*^HBco||ife{cjaL/c3y ta-^J' 

ooji. «!3l«o>'^•-v-^;^3>'^^A-:^'xxf Ai 20 

SB 1 2 1 1 . i^XrA;<^ U 1 2 2 i: , xXf Aa'x 
1 2 3 i: Sr-ttf . x;^rA>'N*;^ 1 2 314, v;^xA;><t 
U 1 2 2 SrJoai^S 1 2 1 izmm-fh . ^Xr :e U 
1 2 2{i;. 'J-h'j}->"J;<^U (ROM) I24tyy 
yisT^^X:*.^^) (RAM) 12 5i:5-S-0. ROM 
1 24(4S*Aai:')j^;^rAl 26 (B lOS) Sr^ 
c:coB lOSJiS^/l^-f-ySr-^tf. C:c7)S*;P- 

[0 0 363 >'N-y-^;U3ytA-:J'i^;:^TAl 2 0 

x-f:^^l 2 9$:a!;^^»S-ri./^*bOiamr-f Yy 
^ri28t. CD-ROM^(7)3te«f*:t-J>S]RaiU^ 

flfceT^;^^' 1 3 1 ?:e!;^iiS|i-ra/srt(?53e-f-f H 
3 0fc$-$A>{c^t;. ^^-Yr-fXi^Vy^y 
127 tmS^T^ Xif]iy>(yi 28 t^T'fX^V^ 

-ens Ota. ^'if^-YTtx^Y^^f^y^y 

s.~xi3 2hm.%'f4X9 Yy^r>i>i?yx-Xl 
33ii^Yy^y^>9y 3^~X\3Ati:ifLX. \y 

x^Axx 1 2 3 s . c<x/i> H 7 -f ytj i If 

SSr>'N*-y:>-yp3 y tfa-:? j^Xt'A 1 2 0(c«ltt-r 
S. CiCitS^Ucnyt^a-^SBRO^iaiftcJi, a- 
Kr^fX^. SXajL^fiiaE^r-fX;?, tiJ:t^CD-R 

[0037] JL— f {4, ^---K- H 1 4 O^V'^;^ 1 4 

>'^•-y:^/^3yta-^'i^XxAl 2 0t3vyH-«f>flf 
%^X:h-th. •7^^'D7*y 1 6 Hi, 3>'hrj.-:^' 
v-X-f A 1 2 0 C^g^cO^^A:*! S-ff a ^i6{Cffiffl-r 
h, i-lf{4. 3yea-^'>';^-fAfc0J^^§(O^ 
77 •y^ttffi$-A:>3tl.J*^, Xff^yX^m^^Xm 
le^'rU-y M 6 2t;/57^ >y^'jt^^fli<. 3yt 
a-rJ'v'XxAl 2 0{4, fS<7)A:^SIS (0^H^f ) fc 
LT>''3-f;<.T-f y;?, y-A/N*.y mKryy-t-. 

x^^-r^ ^"/f-x^v-y. xi> ^ y xm^'^A.X'h 
■7^^'P7:ryi 6 m. xXrAyUicgjiggt 
i^^^-r -f itrfr9 1 6 0 S:^ LTSft^a 1 2 1 fc 
^«?T'#I.. flfi<OA:'5Jia{4, i^XTJ^K-X^zW^l^tz 
U r/P--K- l>^>'^'7i-X146 Sr LTjaa^a 
1 2 1 tcgfig^-f S%&3i«^»r^. L*>L=5r3&«'5y-Ad^- 
h , JL-yN'— V 'J r/P-'N*;^ (USB) ^O'f y^y 
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X J: -oT^i^-ri) c: t ■& . 
[0038] ^-9 1 4 7^c0^^g{4, hrxTTr 

1 4 8^<D^ y^y j^-x^ifLxi^xi-A)^x i 
^i^xrMt, x\i-i]^r^}y9m<r>mm-n^m. 
[ 0 0 3 9 3 >''?-y■^;^3>'tA-^'vXTA 1 2 0 

4 9{C|&a«^LTi,m\ iiSlayti- 

9 1 4 9(4, if-A, tr^a, *a4^>y h-? 

-:7y-H^'*0, -«tB!neA--y:>-;payti- 
;7vX7^A 1 2 0^C|l3atTlf^B3L^cgSO^<^,&V^ 
14^-C Sr^tf . tztcLm 1 tilEligg 1 5 0 
•t. 0Hw^^-ffftaS«?J4, D-;!?/l^x«Jr^-y hV- 
(LAN) 1 5 1i:, '7^KxUT*-yh'7-^' (W 

AN) I5 2fc^^tf. ::oj:^^;r-yh7-^'S*i 
Vm. ^tiy\^a.-9^ vY^-9 . ^yhy 

[0040] LAmmx'mi-i,m^. j^^-v-tiua 

yt^A-^'i/XrAi 2 0{4, ^ yh^—i^^y^yx. 

53S::r^LT^-X?;^;^.y hv-^ 1 5 
■tS. WANIg^-Cffiffl-?-*^. A-y-^;^3ytJL 
-:?>-;j^tA1 2 0{4, — ^t^-fAl 5 4^cO#g$: 

^y^'-^-y KX'Jr^.y fr;-^! 

5 2^:frLTji^Srll4-tS. trAl 5 4J4rt]«ll/i 
<4nftftT-* 0 , ^ 'J r/t'-l^- h>fy^'7x-X146 
Sr:rt-LT>-;^xAyN';^ 1 2 3tcjii^f ^-y h^-^' 
iS^<o%^, >''?-y■^/^3yeJ.-:?xxrAl 2 0<o 

Tt,m\ lljK<7)*-yh7-^'««|{4WT'S)0, 3>t 

A-^'>-x-fAra<7)iifi'j y9\tm^WLX'miLLX{> 

^v>. i3t*||^{4, >"f-y:^/^^veJL-:J'i'X■fA 

liWfc. t^xv^td,i.^-j'^oy\;3.-9i^x'fJ^X'{j 

^^IgT'**). CD-ROMJili'hcO^a, e?«i{4';T-y 
h'7-:?lgg|^y^7x-;?.l 5 3^Ci:-5■C;^X hay 

[0 04 1 3 ^<c7)rD^^5A^>''i-/l-$:3yti- 
9i^XT-A 1 2 0<J5|gij)J|aJ^R AM 12 5 iCtSttT'l: 
7*n^^7Ati<''a-/W4. =/yeA-^'i^>^-fAl 

2o«igti5-$ijfflit. i-if, Aa!:»j^g, hh\.^\m 
/I'— ?-y, :t'^i^-r -f y^'^xf-A 1 3 5 , rry^ 

-i^3y7'o^'9At>''jL-/H 38, r-^fl}ji, 7" 

7'>if. ffi<^y7h'>xrgBa, J)I>v^{47t-A'!7x 
rsPiftS-^tf. *^Bji4. }iS!lA:'3lg-^rni5^7Ati-' 

a-;H3 7, «ia!lA:^'^y^7x-xrn:/7Ati/' 
j.-;H 3 9^fi01oULh<^ro^^7A^:i/jt-;HcJ; 
->f»®tllSfr-|iS. ^ix^>^:xi-/l/0#'«rl4, 16 

[0042] Tr'j^-v3yrn^7Ati''a-;n 



3 8ii, mmizmmixmm'tm^^j:Tr^jy~=y 

-^^:i^*i-/l'24 0. i3j:tJfA:^3J^X-f-A ( IME) 2 
[0043] OTt^SJBB't&^^^^WiSrllff-rs^ca^) 

[0044] ^mmizm^if>^j:X 3 fc. *^^{±fl!!<0 

-^^W^rn-b y-^-i^XTi., V'f^^arn-fe 

Tti. )mi*-yhV-^'r'Jy^'Uv:#jaiS«W||ISt 

[0045] rP^^5 Ati/'a-;KOi(S 
H2«. aaA:'JjgiT-:J'Jftai^>^TA2 0 0oru/ 

A-/P-2 3 0. it^i2gSrD^7ixt>''i-;l^24 0. 

( I ME) 2 5 0. «^B|j|rn^5A* 
>'*A-/P2 6 0T'S>I., ^-^-H14 0J4, ft^W^ 

LT "7- Hro-fe •y^f 2 1 0 fcT=¥;^ h $-A^J 

^mi>)xhimSX'^i>. Zco-M^'r^Xhii, 
«otflsgA^JjlSfci:i;f|f JlfflA^]®&»«^>coA:^i&^tf t 
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iit. MA.tsmx'tfi^mtRT^xhim'th, tm 

A:^ji2 3 0, 24 0, 2 50. 2 6 0<7)iSf^Sr)l<XiJi 

[0046] ¥S#l?ii7-p?^7At>''a-;U2 3 0 
ii. i-i»'*»'^O#^#A^j2 8 0$-S«tKI.. J--if 
{4, x^>f 7;^S:fflv^T»ie^'7"l/--y h 1 6 2izmit 
hZklzX*). #S#A^J2 8 0S:^T'^S. flfe<7)Sg 

a^rfflv^Tt^v^. mx.ii-7-^X14 2i:m^'^X. 
:5'14 7(C»V^Ttm>r^, X^'^f 7XS:fflV^r. :J'-yf- 
X^V~yizm<ZthX'^^, ^#A;'3 2 8 0«4, 

if * t <*i7f'<^—f i^^^i^xf-A 1 3 5o«ie^r 
w-y h H7-f >\'^yjL-;K:J: ^#l2mro/9 
A^j/i-yW 2 3 0^^)111.. 

[0047] 3yea-:?T'¥S*?r)»S?-rSC:t<4ai 
Li-^ ^#i2MSra^^7A^y'a-;l'2 3 0{4. 
^A:'32 8 0$r^{;:J»f?'e§l.t{4|5|4>^V\ roiT 
7Atya-;W2 3 0*^ffo«^oc:i:{4. ###A:^J 
2 8 0(?)1gfl|^4JSL. #1^tiEStt<Oi|*^SilO^ 

a{4, T-:J'fl!jgSrirtf, dcOr-^'^SiSJi. 
mroy^J>,=ii;a.-)U2 3 Oi}<^t^X^ 2 8 0 Sr^a 

mix^Lfzm^T—i'i'ttt, 

[0048] r;i^2 7 oijmmf-f' i^m-^ 

mvxhX'hh, t&^<4, 'm^iz^<^(,tifzmm. 
x'htt:>fy. i^mumtmcth. ^mzmmht. 

m-\i. «)liA^)iS*>'^UciSS;t(4'B]S:y-HlctS 

mh, ^mttz\t^\ijm'f~9X'hhtzih^ ^y- 
mnm^tmtfzii'^izpi'o'^xtzmmi^tmti,, 
mmz'j^m(7):mim\>^xm'immt. ^^r^xh 

?>mm(o^^^m&Lx^i^W!'ii¥^x'^h, zco^ 
^mmmtiztizxo. mmhT-^xhmmco 
mmi^x'^i. 

[0049] l|*f-:?J4. <X h nmk ^tlhizm 

^'^Xt:mmbmxhX'hM:\,'>. S)imtfzii'^l,zn 

m^hi^tzthX'^t. 

[ 0 0 5 0 ] §^i2il7'0/7Ati/A-;U2 4 Oti, 
¥S^I2iJrn^5At 2 3 0 1 TOctW^^ 

S. /iJt'LW2g|rp^'7Ati;jL-;P2 4 0{4. V 
>f ^J'o^ * > 1 6 1 %:'ftLXJL—fi>^ti>'SPXiJ 2 9 0 
IrSttaXS. '7-^^'07=r>l 6 ^'^W— r-f >- 
/vX-fA 1 3 5c0-7'f ^' Q7 * > H ^'f A't 

^)\^t/^n^j:UWrz< ^A.hhi}->hX-hh. fi!-?T^^ 
i2^o^^5Atxa-;U24 0t, *til^«$-^-r 

h, mm=tr!V27 0h\i^ r—^mmmith. Z 
<^T-:?«Ji{4. W2itro^5Atv'jL-yP24 0 
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2 9 0 ^:9mlx^lfcm^T-^'it 

tt. 

[0051 ] X-hl^Xfh, (IME) 2 50t. i|*f 
-^Sr^-rS. I ME 2 5 0{i. A^lSrJ'HIiST^;^ 
ht^m-fl. . I ME 2 5 0^c7)A^)«i. mUf^-if^ 
-HI 4 0i3j:l^-7'>Xl 4 25::ft-lT3yea.-;J'A. 
9ArK1]Lfi.^^7.VXhh, i|**f;l^2 70c 
T-^'tiit^r^tf. 3<7)T-:J'fflJtll, I ME 2 5 

[0052] IME25 Oti, rvTWlS^tCfeViT^ 

^^t^t^X'hh, I ME 2 5 0 S:i$i5^jm»?. ^'^<r>m. 

1 ME 2 5 OOiS-^, J--1^«iHlfao4>llli3S:^iOt^ 

^r«.«t'B^:*:*<?3loS:«0L-CV^J:a. -eciT'lME 

2 5 0 . «-aOfemS-4«-t htmz. f>{ rLfz^P 

[0053] 1 -:><n%mK-hmti-»E^'thm&&mi. 

tyjL-;KOfi|fig2 9 3(CS^-fi:fcOT'*&. CKOffllg 
2 9 3{i. IME2 5O<Oflfi<Ol^0l!-Ct»*>5. 
[0054] ^cOHSfiMtCfct^T . ^iS^^A:»J 262 
$-ayeA-:5'(::A:>3L. -?-ix5:i$-D-cS*lix^>^h 
#^2 6 2{i, §^l2MirD/5Atv 
a-;P2 6 0'\MS. mm:Tuy'yJ^^Va.-tV2 
6 OfiOijf^Ji. -t^fg^rn^yA^ X A-;U2 4 0<0 
Wf^fcdTOCT'*!), t>''a-;l'2 6 0$'aiLT7i^ 

'^T-J)S. fi?iJ;ttf#^l2irrn^^^A*i/jL-yl'2 6 0 

iSSrBIS?T'# S. i|$^T;^2 7 0 d{i. r-^'^ii^ 

;w2 6 o*<#^A^jS:«yifc4«Uca[*T-^«-ir 
tt. 

[0055] mimP ■ B*H I MEC0%&, ^^fgiS 

rn^5A^>''a.-/U2 6 0Ji, ^i§#?^?!)»<i>||i§r=*r 
xM^fflSri^L. -?-nS:?l*tr/U2 7 0d{cfSlftT' 
# I. . Ji^t f ;P 2 7 0 d izmfi Ifz 1 oja±<7)^igT 
^;^M^«i. I ME2 5 0'^<OA:^)t^rl). IME2 

h. I ME 2 5 o^(D^mx.:hii. mmmmm 

^ti. titiL I ME 2 5 0A^2o<Dfflgi|A:fjta 



[00 56] 02^C^3^^T, #^f2SJrn/5Atx-i 
-;l^2 6 0 I ME 2 5 0^<7)3i^Ep{i, ■g^fSiSTn 

>''jL-;p2 6 o*n ME 2 5 o<r>mmx:hmx' 

hlZti7F:lX\'>l. U»LC:<t<i>20c^ro/5A 
ti^'i-;Ki, mi^ y^y x.-xLX\^hhi,'rX'\itc 
H^i2«SrD^'7-iU^ >''a-;P2 6 0A>4> i me 2 
5 0A.(^)jtsiA:»3J4. mt\£m^X^^y9yx-xi 

^<^m^XMy'^yx-xi3 9i,znLX.^m 
mx^mtmm^Lx\^i,, 

T7-'j^r-x3y2 1 ot<D^x\ m&r—S> 

i^iimt. :iz.x'mmx-))m^i. mm^mttmr 
D^7At>''A-yi'2 3 0. "^PWMruy^A^v^ 

-1124 0. IME2 5 0-C*S. jtaiT-:}'ii»tL 
TJtaSA:?]-^>^'7x-xi 3 9S:ia{tt, mir-i' 

{±> itasA^j'f y^'7x-xi:coa^l^rffi$•^^fi(tT- 
^<s m&x-)]W\znLx<nMm-nm^wz^j:^^x'h 

v\ JtSlr-^'SrgftlXl.rry^— ^3^2 1 0{4, 

7"i;y-v-3y2 1 o«4. ;^TW y Ki^-h. 

■f. «i^^-;w7-Dj!^7A, ^mw^r^yyA. ca 

yvymv-Y=tyYff)'7A9uvy\' ^cnm^nXh 5 >» n° 
[00 58] ^-Yrx3-^y^%tm<n^, 7-Hr 

D-t-yt2 1 0{4, mmX-n^y9y:L-Xl39iit 

Lx. wmm&T^xvii^^mh. ^mMm&\i. v 

-Hru-fe •y-!ft=T-:^^A:>jL7t«#jfflA:»3aSi*»f>cO 
t<OT'J)S. ^a«0«a!IA:^JJi*><oV-HTn-b-y-t2 
1 0{CT-:J'5:3iil,::fc{cjp;c, mxM^—if--Y\A 

■b y-9-fcA:*T^S. '7-Hrn-fe>yHf2 1 Oti, Z.h. 
hTtT—9-^^^^\.XW^X-my-^XY^'{f.^\^^ ^ 
il*i-1f{cjgfl^-t5. '7-H7'n-b.yt2 1 0J4. -5- 
<OTf^4f:^H<0#S3g<OA:'j^SrJt.— ftc^S'SrvW, IB 

[0059] y-Hrn-fe y-9-2 1 0J4. a.-Hf*>'r^ 

^x-y)m.^Tt\,zLx\.^m.. mwm-^h, v-h 

7'D-b.y-9-2 1 0J4, a!RT^Xhg«ci:-eiOSlRT^ 
X H<^S«WA:'3aii: S:«a!lA:^>f V:? 7 x-X 1 3 
9^iS0, fi^SUXhSrS^-ri.. Jta!lA:^^y:J'7x 
-X139«i. -etOS^SrSaSL, ^3ltRr=^Xh{c*t 
t'*l^'JXh?:7-H7*n-fe y-9-2 1 OtCjg-r. 7- 
H7*D-fe.yH^{i, yyy ^i)iV:3.-^^y9y 3.-X^ 



2 7 0 {^5gts^^^s^f ii^ei^ i.moxh i^git 

[006 1] ftaSA:^*!-^^! 3 7J4, m&UXV^^ 
^^^137{i:. fil*^T;U2 7 0«>/^«^L,/cffl 

m&^jxv^^m-fh. z.ti^zm-i^^?mm'^T>v2 

[00 6 2] 02{CfcV^r. fiia!IA:'3^2 3 0. 24 

0, 2 5o\i.. ^n<rsimT~9im>imx-nniit 

Lxm.JiWhZt^j:<. '7-Krn-fe-ytf2 1 OttlW 
•t^rt)^9ftfflA:»3iS2 3 0, 240. 250 

#^3Wtg^ (/::?t'LMA:t)^y^'7x-xi 3 9 

$-:)>LT) iiaiT-^'^'7-Krn-fe7l>-2 1 O^^h 

Zbii^X'^h. #A:»3«*»<.«oiS)g5T-^(4. '7-h-7- 

^A:>]jS{4. M5i|ttffllA^JjB2 9 6TA'). MyOfllfiRi: 
[0063] :itltX'^rxiyyl.^V:i.-Mm'i^Z 

mmtxttztii. '^f^t,z\mhi)^^£Xoi,z. ztit^r 

m^i:m:LbhX'^h. mi.zmAxm^^ 131 h 
nmm:^)\'2 20t\i. miX:h^ y^yx-xi 

3 9I.Z^>hhZtti^X'^h. Ztl(y3'0<7)ruf^yM.^ 

'JjL—Mi. ^^v-T-tyyi^XTM, i 3 5ttz\i^ 
-Yyn^y-^2l0ffy-UiL-fhZ.thX'^h. 
SI 3 7*5j;y^a^mgtT;P2 2 0{i, #ftiL:tr 
n^^AtL, #'!?**V-KrD-fey9-2 1 Otil^-f 
y:5'7x-;^LTtm\ *^cJta!IA:'3j!S2 3 0. 24 
0. 2 50, 2 6 0{i, ^-irjjtatirry^r-i/gyr 

{oo6AM^&imm<r)mm3. 5, im, * 

■ttMz. mmx-nm^m 3 7 tnmm=t'ffv 2 2 
otoiaigsr^-r. 

10 0 6 53 HStfcV^T, 2 014. « 

aoOxdfXhAifj^SSrWt. r*>^><?)A:^j^gJt 
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SXO. -5-ixS:7-h*7*n-b.y-9-^(7)Trilir-v3y2 
ICVjIil.. a>'tA-:?{4, ^£;0A:']^f-^Xh3 0 
OtC^L, =^-9 1 4 7{C«^-r&. ^WttJV^T. 
a-if{4 TTHIS IS A MESSAGE WRITTEN BY A THOUSAND 
MONKEYS TYPING AT RANDOM. {Z.fO,i.'^W(^mK' 
fz(>i!bizif^rLxm\>^tz;<-y^-i^Xt. ) j t£Or 

TTHIS IS A MESSAGE WRITTEN BY A TOWN OF MY KEYS T 
AIPING AT RANDOM. (ZtHiXfzf^mzfL.cmcOi' 
fy«7)fBr*<»V^/^^ v-b-y-Cf . ) J tfmi. T 
^:^h3 0 0S:4«L/;. 

[0066] ^^X h 3 0 0*^^^^$ni, . f {4 
-e<^-aS:«IB^-tS>Ih{cJ: Stl?x^xh3 1 
OSrf^^-t*. 03c^a*^T^Xb3 1 0J4. 3-:><0f 
^Xh®^3 12. 3 14. 3 16Sr^1-S. Z.t\.tyT- 

^xYmxt. ^'^ifW£h%mx-mfi>tom.'^iix 

\^h. 'S\xM^^XYm%3 1 2{4, ^%Vi^uy 
7A€ >?a-;|.2 3 0/)t#S#A:lj 2 8 0 1 JSaLT* 

^Lj'cf^cTjioTS)!.. z.ffmmK-hm-^h^m^fz 

m^\t. 2 7 0a l> ,'§m^}xy3 

1 8J4. i|$tT;W2 7 0 a^iimm-h=f^X Y^%3 

\2i?iimi')xvxhh. ^<rmw)xYi}>h^v\::L 
-m^ rTowN IF (am) J iiM/j-i, -?-oai<4. 

T=^Xhg^3 1 2<07ct^o;^c¥^§A:^2 8 0tC*t 

i^'J;^h 3 1 805 TTOWN OF «rm) J *«*a"C 
J>l.i:3yifi-:5' ^JflB} Lfc*' f>-CJ) S . L*- L^*< 
^><^'Jxh3 18*%-rJ:o{c, 3yta-^{4 rco 
(iNOF(<^;y«>y) J iJj:t^ rTH0USAND(^E(:O) J tr^^ 
hW«3 1 20(^i: LTligiSLTV^&. 
[ 0 0 6 7 ] #^|gi|rn^7A^ ->'a-;l.2 4 0{4. 
W^A:^l2 9 0$:S!iSLT«lii^«$r^L. -eix^rSI 
*^T;^2 7 0bt^S|frt«.. flffl'J^^h 3 2 0(4. T 
3 1 4 m LTSi^^ 2 7 0b t^W^ I 
TV^Si^T'jbl). CC:T'{4 TMY KEYS (S.<7) 
a) J i/iMmX'$>ltLxmiR^ixt:. 
[00 68] y'^Xhmm3 16 TTAIPING (9 

^^y) J {4, ^3<r)imxi:mt-{^^x\^h, z.a)W. 
mx-))W<^mMi. ii^tT;i^2 7 0 ctctsiw$nTfc 

0. ^^)XV3 2 2\iZ'^ttlh. 
[006 9] lii!lA:»3^^§ll 3 7{4, im^}Xh3\ 
8. 320. 3 2 2i)>(,U'^^m-^^^^^L. ^m-^ 

^(nmwjxhiif^m'h. c:oji<a#w4. jMjrt^ 

X h 3 1 0 ^^^frs#«■&^i■coiEStt<o^|^?^|^® LT 
tr^. jMlRTdrXh3 1 0 tWStifflO^ *>±fi[iS:. 

h 3 3 0 1 1 4 7{C^-r*, 

[0 07 0] ia4{:fc«r^T. miiUx\-3 3 0<rym. 

a. i-if{4aii?T^xh$:«m-ri»ittTi=i.. m 

4tCfc<,^-C r<4. t:^:?'fBm4 1 0{:J: 0, 
rjOWN OF (<3DHr) J ^ TTHOUSAND (1^ 
E<^) J T'±S^ Ltz. ic^J^m, SS?T^X h 3 1 0 
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(ny-^Xhmms 1 2(4. T^;^M!3g3 1 2' {C®J« 
[007 1]*BSI4 10{4. #-^-H14 0i:'?'>;^ 

•fl)(K»{4*v\ CcolSlltil^U;^ h 3 18' 

■rs. i^'j;^h3i8' (4, m3(mm^jxh3i8 
izmtmh?>. mnvxhsia' a. tthousa 
ND (i^Eio) J i)m-(7)mmx'f>i. 
[0 0 7 2] mmifimi-i,t. 

i^fflU;^h3 1 8' ,3 20. 3 2 2*>4>lftf«''r 

f^fig-rs, c:^i<o«iffl{4. mwj :^h4 30 tlx =t- 

[0073] mtRT^X h 3 1 0 SrSJ^-rSl^t L 

i^ffluxh4 3o^^^$nT^^l>tc;)J-:I^'l•^J— 
•!f*««ll-f--^.tl^, ^—fimi/mvi7'^xh3i 0^ 

T^;?.h3 1 0{Cj)S TMYKEYS Jf-X: 

J TMONKEYs (^y^-x-.m j $r 
±m^^>(rtxmm4 1 2S:ita. -f-to^*, r^x 

h^^3 14' ^MONKEYS (II) J *«f ^XhS 
3R3 1 4 TMY KEYS j iz3.mhh. 

^Lxmi'jxh32oi,mnvxy-3 2o' izm^m 

h&, Z(7i^OXh320' l-yffyfmtiJ^t 

^j:'t'k(>mmMmmmmri*-(^x'f>h. mm 
X:hm-^i3 7ii,m/^mvxh3i8' , 32 
0' . 3 2 2<r)mmim^. miRT^xh3ioizn 
i-?>f^muxh4 3o' ^f^^-ri). 
1 0 0 7 4 ] zcom. :3—^iim^T^x h 3 1 0 fflco 

tX. mW^j:fm4 1 4 5r^ott. -?-«0«4 1 

4€:a!P^$Ht'cai!ii!"r4. i-s,tmmi4imifi 
v'^xh3io^zmtm>*). «^Hr^■f^xh4 1 

[00 7 5] 05tt03fc|B|«T*S*<. i^^^r 
;l^2 2 0S-iftf^S-t;t%^<0*^B^j|SC^$-.T^-r. 03 
iZjf;lfzXol>Z. i— f(4r^XhW^3 12.31 
4. 3 1 6*»/o^l.^S?r=¥Xh3 1 OSril!^. ttSSA 
:']te^Sl3 7{4, in'bx#xhS3g<7)S«=5rig|fflffl 
^S-f^«L, S!>?T^Xh3 10ffl4'IS<^'jXhS- 
f^^-r S . 1 3 7 {4, SSIl^Sr^-ri. w t 

i^ix. Mfe«t'iai^'jxh^g^swffi*T/i-2 2o^ 

[0 0 7 6] gMmi§^T;l'2 2 0{4, atRT^;^h3 

1 o:kmz!smmmwmi:mm-tizt izx o . t^^mm 
wjxh<D^m(ovmi:nwmi-i>. z<7)mfmii. 

ti. a^«WlS^T>P2 2 0J4, f^^imv Xhil^iSt^j: 
^^W^trmJlftthZtthh. e^«M^tr/U2 2 0 

{4. '^^mm^jxhrnmrna^t/^uzmimmiz 



l/sl37izmt. iK-^Sl 3 7{4, lgIEfg1i'jXh*^<i> 

[00773 smn^iffvmmzx o.m^jxh 

530{4, ia3cO<^ffl'J;^h330J:0^nTV^|,;i: 
{ie-5Ta-if{4. liffli'jXh5 3 0*»^>1^5 
0 2 i:mRT^ . ■m'r^X h 3 1 0 

ll50Wcfev>Ta— f{4. S!tRr^Xh310 
Sr<^ffl5 0 2 tl^^xSdiiHJ: ijjKLv^r^x h 5 
0 4S:f^|jg-ri.. 

[0078] Kmmtmeoy a-^^- h 

116(4. itT-:^t:9m-ti>f~:ibcr,it^&j)V-i-y6 0 
0<7)^XTyri7F:-tyn-^^r-hThh. dCO/l— 

f-v(4, 03. 5. I nzT^Ltimmimntifzi^ 

<^^fm^-^ti, ;W-f->'6 0 0(4. Xf >yr6 0 2(C 

^>7ET-:??:S(t)RI). A:>JM(4. ^S'lgiiro^^ 
i.ti/'a-;U2 3 0 . -t^isaSTn^^^At i;a-;P2 

4 0. x:hi^xTi^2 5ow<7)mmx:nMx'ij^\\ j> 

SV^{4^-#— H 1 4 Ofcit/V^x 14 2 S-ffioT^' 
^ty^A:»jL;t#}iSIA:/3T'tftvn hl^^im^l 
t:2r>&±<0imxijmiP^ff)f—i^X'i>!^\>\ j^L 
JtitilA:'5jlS*>4>c0T-:?O:^-lr. «*fllA:>]?S{4. 

[ 0 0 7 9 ] V- h-rn-fe .yi?-{4. «»coA:']ji*>4,7c 
T-^'i&S{tIRI.i:. X-r yr6 04(:fc»-^-C-?-«0T- 
:5'S'1SRA:']iSr^Xh?iJ(:^^-r5, -T^i?**. 7- 
Hro-fe-yi;-(4. 7CT-^'(:S:t*5^|.T^xh$r^L 
Tt-^'14 7(C^nr«6t-tS. Si5>(C7-Hrn-fe 
•y-9-(4. -eOx^XhcO^ifc^A^jiSrieiS-fa^ti^cO 

[ 0 0 8 0 ] X-f y r 6 0 6 (CfeV^t . J.-if(4je!S(C 

h(4, 1tStOA:)3jSSr*-ri.x^;^bT'tai<^. mHi 

X"7<yx:ii^'y^m-. '7-h'rD-b-/1?-(4. ■!-c;)l!JR 

[0081] XT !'r608(CfcUTV-K7*D-t .ytf 

(4, a^»?T=^fx^^^^^-rs ri^^j nvyn^rstt 

JX*. ^tl(CJEEtTV-Krn-fe-y-9-(4. Xr ;'r6 1 
0{3*j<,^T. «a<^A:J3aS(*»t6a!«?r'¥Xhffl<0li 

[0082] Xx yr6 1 2(Ct5V^-CV-KrD-fe /-9- 
(4. •eil^>li*l^*-:?14 7(C^-*-|>. 7-h'rn 

X. ^tihmSi^mmm.f,z^'fhzbmt.L\\ z 



[00 83] ;?,T /r6 1 4tiDl^Ta.-1fli. 7-K 
ya-t 'y^i,zo-7y\im-x.^. Z(Da'7yviimi 

s?T^:^ h9\^7'^xh(o-mz'?^xi:nmirir^^y 

i:Wt:ibizX Sfr^vi^T^x yMmmx'hi. 
10084] Xr yre 1 etijV^T. V-Krn-t-/ 

•yT6 18{=i3^^TV-HTD^r■y-»^{i, -e^OStRI^ 

XT«yr6 2 6{:J3V^t;P-f-y{if^T-r'S. Xr-yT 
6 2 6<0t*, ;^r>yr6 0 2^^S-^T/^-f•y^:^'3)i 

( 0 0 8 5 ] yre 1 6 Cfcv^Ta.-- 

7*6 2 OfcfcV^-C^-Krp-fe xifti, i— !f*<ajRr 
«y7•6 2 2^CfcV^TV-^'T^•fe.y■9■^±, 

— f-yjixr yre i otcyp-rt. lBlfef*a!i?-f 

[0086] Xr-yrs 20(:i3l,^T, Xr-yre 14 
HT'^ftmtf , 7- Hro-fe >y9-{±Xr -y re 2 4^ 

^•fs. z<^^-^j.—fii. miRT^xh^nmx"? 

4{ct>v^Ti^cO^S:4'llrU. Xf ^7*6 2 6tt5>,-> 
f^U-^y^r^Tti). ClCT'Sl^Xf •yr6 0 2(c;^ 

[0087] 07{i, 06c7);P-f->-6 1 OCOmm^^ 

izMlx. mii^M-thfztb(r)^xf-yri:ii^. m 

•fe y-trt^^rn/^A^i/A— :t^p-r-0^^i^ 

■< yi'y 3i-xi'hm.rnir^M,x'f>^, 

[0088] ;t—f->-<iXT-y 7*702 tm-f I.. CI 
i-CiiilA^Jfe^SHi, 3lt)?x^xh$r»WL, A:/3jS 
SiJcOx^xh^Stc^f^-rs. •r=5ri5-<p}tS!lA:>3*£^S 
{i, ^-^^ 1 4 7{c«^Lfex=SfXh<^#S<0A*ii^ 

hmm^z^m-t& zt^zxo. mmT^x hwi^cosn 

[0089] MM^izmktf^j:^ oiz. t^x hWm 



(a 2) )0 1-249920 (P200 1-249920A) 

<r)^mii^mx'hh. m7'-'9i,ziriUci>cr>tim^j:h 
m)j\,-i-yimmz^wt^'^mm . r^x h s 

ff)^(r>^mztmi^x-rit. 7t<r)^tmmbim^ 
&T^xh^mt^j:'oni. 

[0 090] m7t5Xl/m9iZt5\^X, r^XhS^ 

imxijmt:^-ti>Mi±^&T^xhtm^L. zcof- 
^xmma. ^^ty^'tj:o«*L./i7^='pxhsa 

tszbti^X'^i, 9^ v:ynzx 'ojtis.Ltim&T-^x 
hX'S)r,x. imxi}m^t^<r)7-^xhmmcmmxu 

(fium^j:hmmx:hmijit-th2^(r)f-^xh^m 

(omz. :j.-^tfi9^\fyyi>zi'>XT^xh^nXL 
fz^. ZcoWAUzT^XhU-^tiS^ifiT^xy-^ 
^X'f)&. 

[0091]Xryr704 ^zH^^xmxm-^H^ 

«i, ^n<^tmx:hMi^thT^xhmmiz'o\.^x. 

M^-flvm'fif/vi^mtl. xr>yr7 0 6(cfcv^ 
T»A:/5^^S(i, mm=eff\^i3^Xm-Thi>i)^ 

f}'i&i^-ti. ^xnmm=i:TJVifiWf-vhtn£. XT 

[0092] XT«yr7 0 8t:fcV^Tltfl!lA^I«r^« 

^ifz'km'iTmix^9m'^:^&-t?>. z 
co^^ma^mhtz^fz*), trnxm-^^i, 9 

-fTU:tT^fXM|5gffly-H2r4fi£-ri), 9Ari,fZ 

TdfxMSji. wkx-mkitb-^hT-^xv^^iz 

^t.Kt{i\-^, Xr yr7 1 0{Cfcv^rA:^fe-^J±, S 

3!»?r=¥xM1g|fflUxhS:f^jig-tS, {jctx-r yr? 
16{C^i). ClcOXr-yrfi. Xx-yr7 0 6<orN 

[009 3] Xf-yr7 0 6(C^0. Xr yr7 0 4-C 

^Ltz-^Tm^Tivifm-x-tamii. mkx-)M-^ 

mtXT-yri 1 2^mii-h, z<7)%^, :k^j:<tij 

1 o<0S|^* f h n Hfl 'J X h T-ife S . o T 

»S!lA:'iie^Ji, #1g^^'<xhni^ijxht|^ 

[0094] XT-yr7\A\.Zii\^xmmX^^-^^ 

(i. ^xh nmm^} xvt-^ArLfzT^xvmnti 
mm(r>hh^im.-^^mm-izt^z^-yx^') , it 

a!lA:>]|ig^g|Ji, #«^-tto(,^T. ^tRx^xhtc^i 

ixhm\,zm^iuw:th. iLxm.T^xvmw^ 

')XYim^-th, Xf-'/ri 1 Atl^hXr-'/Tl 1 6 
[0 09 5] Xx y7-7 10if^{i714*>4,Xr yr 

7 1 e/^jitfi^. «jB!A:>jie^i±. s«?7^*xbfflig| 
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[0 0 96] Xr-^ri Xr-yr 

T/Hi. immuzm-mrnxv^iimx-)}^^ 

fzim'jxh<r)^mm<nmmi:m^mLfz^^i'^tj. 
mMm'}xui^hkz. mm.T^xv\.zn\^xn?m 

s^jsms/saji, ufi^=tT'}\^tiim{^^xv^mz 

[ 0 0 9 7 3 y r? 2 0 (Z*JV^T»SiA:^©^S 

ri 1 6fcj:i^7 1 sfrUff uc*&. wmx-hm^^ 

^SfflUffl^riStR-ri.. Xt"jT1 1 6fcJ:t/7 1 8^11 

Ii7 1 4fc*JV^Tf^J«Uc«l*IUXI>*»i!,^fflli|ffl^ 
StR-fS. X-f y7'7 2 0«OSi, :^T-yr7 2 2tCt>V% 

[00 983 08{±, 07<7);l^-f-y7O4O#XT-y 
tr/l-SreiSgf XT-/r8 0 2tfcV^-C;l'-f-y£ 

siSftL. ^f^xvwi^^9mth>v-r\,zKh. x^t 

•yr8 0 2tcfci,^TMA:Jj^-^^{i, r^xhS** 
X=f •yr^QA\,Zii\^xmkKn^^\t. 

^mfr^x vw^if-mmr^x vmmxhht-^is^-kTik 

jrrs . *fiaT='E-x hS^T-^rtmtf. Xr .y rs O 6 

T-^xY'mmmmihhtz^. mskx-m-^mi. 

ryrtim^l,. 

[00 99] XT 77*8 0 4^^0. Xf- yr8 0 2-C 
a!lA:^3^-^«Xr-yr808S-||fi=-r.6. zcnxr-/ 

r[zii\.^xim?<f}m^mi. ■5-<^-f=^x hs^as^ 

U^^}ia!lA^iaS*>'o*>S*»S-^^1-|.. jtB^L/cttifflA 

■huij^iyX'hmi. m^x^^^mixr-yrs i osr 
m.Lfzm&x-nm(r>mtmmmxi]m^ 

m^X-nt^-^^itXT-vrs \2im^-fh. Xr-'vr 
QOQ\,Zii\^X. XT-yr8 0 2T-^L7tT^XhS 

mi^mLt:imxi)m>hX'^j:\^ni^\>. xr-yrs 

1 2^Jlt?. 



[0 1 003 XT-yr8 1 2\,zi5\^xmmx-h^^^ 

f**^-'-K-KfcJ;l^-7t^x$:ffifflLTr^x 

XT -y 7*8 1 4tt>v^-cttaiA:»je^{i. nmhn 
*<irF-c-*fi{f, m,T^xvmmi}>mw^tifzm^z 

y-KS:jijirrs. ii$^T;i'*<'<^xhnMffl'jxh-c 

A$it!tiS*ji3irtS.ri:l=J:0. 'Jxho#ii1f^j| 

[01013 XT-yrS 1 4*><oXT yr8 1 6^jl 
O.XT-y 7*8 1 2^Zii\^X:J.-^iii%m.T^XYmm 
S-feSLTV^^V^t^^L/cJ^t, XT>yr816^ 
ittf, XT y7-81 6(Cfc^^T, fiTSHA^jfe^gilS, X 

"f-vraoix'WiKLf^y-^xvwmzm.^ n^tT 
/ng**«i«^s . iKTdfx vw%ifim LtMm=t 

'fi\^t^^tzi,(^x'hti\i. im Lfzm^=t'TMiXT 

>y 7-8 1 0 -Cf^fig LTta^^^ t;1-T'^ I. . S Ac SIS 
T^X b g*3!)^§ iiT V ^il{f , XT y 7* 8 1 4 T'H 

mLtzmmm^Tfvx'hh, imuzm^^TMi. 

T;ui. mm^xhmmizmtimtir^/iX'^^ini 
s^w, ita!iA:>]»-^{i. ^^T^^xhmm^^tt 

i-hzti,x'^h, 

[ 0 1 0 2 3 XT-yr8 0 2-CSJ?Uv:t^X hW* 

i±, mmxM^^<^i><7)X'i>. mm^-tmm^T 

xhS*<ig^JiS!lT=¥xhSigt|Sia{c8io, ^^i?*> 

1 0 0%<Oil^Sr|flOi-CI.. «im:»]Sg^S{4. XT 
•y7*8 1 6cO»:(CXT-y7-8 1 8 SrUfff 1.. 
[0 1 033 XTyr818{4, Xf-yTSiettzti 
8 0 6{C^V>Tll1t-ri), Xf 'yT-SlStfclr^TjSaS 

x^i^-^mi. mK'r^xhp^iz^fm(7)7-^xhwm 
\i. )V-^y\ixf-'/rso2\iZ)V-rL. wmxiim 

[ 0 1 0 4 3 XT y 7-8 1 Sl.ziiUXW!Mcr>^^X h 

mmifim'Uxi.i. ^x^t^'^mi^ri^BycoxT-/ 
rQ2oimiL, umm=(:r)v^mmth. z<r>x 
T■y7•{cfcv^T^tayA:>7^^-^S5{i, ^^xv^mm^iZ^ 

tT/wi. -eiOT^xhs^fcSMmiiiggi^jSfflL, 

^«$:ffifflA:^l^^fstj5-r. XT-y 7-8 2 OtCfcV^T 
a^«MStT/H4. a5?T=JfXb^#:T-J4^<#T^X 
hS«fc*tLTirf^tl.?t4'5. XT-y 7*8 2 Oti, XT 

•y7-7 1 6feJ;lX7 1 8<0f^t LT*>SV^I4iilrat L 
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8 2 2X'm-th. 

to 1051 09 li. m8<7)JU-i-y810(O^XT-/ 

1 0 1 0 6 J fi-^ya 1 oiixfyrg o ix-mitkt 

^S^-rS. miIII@co;^T-yr9 0 2{Ci3V^-C. JtaiA 
:']1S^S(1, ^L>t«iaiA^l?l!?*»^>mifilifi!lA:>]?S5: 

[01071^1 wskx-hmtm.<r)<mk^\.. z 
XT yr9 02X'ii. mmxtim-^m. xr-^rg i 

«!IA:*]jSffl<^^^«lll'J;^hSrf^BgLTV^S. c:<ot8^ 

it«'J;?.Hi. S«?L/v:fSa!lA:'jjStB9iitl.i^^^ 

^•C'^5 0 , zti(:>miid:(7)mmx:hm<r)X:^t ixm 

t>ixh, Xr-yr9 04Ji, §!J?L^^flljiA:»3astM-f- 

^ci3^^■Cft■SlA;']^^^i. S!RL/cA:;]j®ffll^«oi 

[ 0 1 0 8 3 -yrg 0 6 fcJ3V^Ttia!IA:>]fe^S 
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1 Title of Invention 

METHOD AND SYSTEM FOR PROVIDING ALTHRNATIVES FOR TEXT DERIVED FROM 
STOCHASTIC INFUT SOURCES 

2 Clains 

(1) A conpuler-iiDpleiDented nethod for correcting text, coaprisiag 
the steps of: 

receiving a text selection coioprising a plurality of text 
components derived fioia different input sources: 

at least one of the text components coinprising a stochastic 
text component derived frota a stochastic input source or a series of 
stochastic input sources: 

receiving a command to display alternatives for the text 
selection: 

parsing the text selection into the text components; 

retrieving the stochastic model for the stochastic text 
component from its associated stochastic input source or series of 
input sources: 

combining the stochastic model i^ith other text components to 
produce a list of alternatives for the text selection; and 

displaying the list of alternatives for the text selection on 
a display device. 

(2) The method of claim 1. farther comprising the steps of: 

receiving a user command selecting one of the displayed 

alternatives; and 

replacing the text selection with the selected alternative. 
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(3) TLe method of clain 1, further cooiprising the steps of: 
receiving an edit lo the text selection; 

pfoducing a revised list of alternatives for ihe edited text 
selection; and 

displaying the revised list of alternatives for the edited 
text selection. 

(4) The nethod of claim 1. further comprising the steps of: 
receiving an edit to one of the stochastic text components; 
retrieving a revised stochastic model for the edited 

stochastic text coioponenl from its associated stochastic input 
source or series of input sources; 

combining the revised stocbastic model with another 
stochastic node! associated with the text selection to produce a 
revised list of alternatives for the edited text selection; and 

displaying the revised list of alternatives for the edited 
text selection. 

(5) The method of claim U wherein the text selection comprises a 
portion of text in a file within an application selected from the 
group consisting of a word processor, a spreadsheet, a browser, an 
electronic mail program, a music transcription program, a CAD 
program, a presentation program, and an operating system. 

(6) The method of claim I, wherein the step of displaying the 
alternatives for the text selection further comprises the steps of: 

ranking the alternatives for the text selection in 
probability order; and 
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displaying the altcrnatiyes in their rank order on the 
display device. 

(7) The method of claim 6, vhcrein the step of displaying the 
alternatives in their rank order further comprises the steps of: 

selecting a pre-'determined number of highest ranked 
alternatives; and 

displaying the selected alternatives in their rank order on 
the display device. 

(8) The method of clain 1. wherein the text selection comprises a 
plurality of stochastic text components and the step of combining 
the stochastic models further comprises the steps of: 

combining the stochastic mudels for each stochastic text 
component to produce an interim list of alternatives for the text 
selection; 

providing the interim list of alternatives to a natural 
language model: 

receiving a revised list of alternatives for the text 
selection from the natural language model, the revised list of 
alternatives comprising a reevaluation of the interim list of 
alternatives based on natural language principles applied by the 
natural language model lo the text selection as a whole; and 

displaying the revised list of alternatives as the list of 
alternatives for the text selection. 

(9) The melhod of claim 8. wherein the revised list of 
alternatives also comprises additional alternatives formed by the 
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natural language model that are not found in the interim list of 
alternatives provided to the natural language oodel. 

(10) The nethod of clain 8. further conprising the step of 
providing the stochastic model retrieved for one or more stochastic 
text components to a natural language model for reevaluation based 
on natural language principles. 

(11) The method of clain 8, further comprising the step of 
providing the stochastic model for each stochastic text coiopoflent to 
the natural language model for use in creating the revised list of 
alternatives. 

(12) The method of claim 1, wherein the text selection comprises a 
plurality of stochastic text components and the stochastic models 
for the text components comprise lattices, and wherein the step of 
combiaing the stochastic models to produce a list of alternatives 
for the text selection further coiaprises the steps of: 

concatenating the lattices into a metalattice that includes 
information about any text components that are derived from a non- 
stochastic source: and 

producing the list of alternatives for the text selection 
from the metalattice. 

(13) The method of claim I. wherein the text selection comprises a 
plurality of stochastic text components and one of the stochastic 
models comprises an "n-best" candidate list and anotlier stochastic 
model comprises a lattice, and wherein the step of combining the 
stochastic models to produce a list of alternatives for the text 
selection further comprises the steps of: 
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creating an "n-bes(" candidate list corresponding to the 
lallice; and 

producing the list of alternatives for the text selection by 
conblning the "n-best" candidate lists for the text components. 

(14) The method of clais \, wherein the step of retrieving a 
stochaslic mode! for a text conponenl originating froin a stuchastic 
input soorce further coaipriscs the steps: 

detemining if the text component is derived from stochaslic 
input sources configured in series; 

if the text component is derived from stochastic input 
sources configured in series, deriving a series stochastic model by 
combining together a stochastic model from each stochastic input 
source in the series; 

retrieving the series stocliastic model as the stochastic 
model for the text conponent. 

(15) The method of claim 1, wherein the step of retrieving the 
stochastic model for the stochastic text component from its 
associated stochastic input source or series of input sources 
comprises the steps of: 

receiving user Input into a first stochastic input source in 
a series of stuchastic input sources: 

selecting a stochastic result comprising a plurality of 
alternatives produced by the first stochastic input soorce; 

producing a plurality of stochastic results for a second 
stochastic input source in the series by using each alternative of 
the stochastic result produced by the first stochastic input source 
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as input into the second stochastic input source to produce a 
stochastic result for the second stochastic input source: 

if any stochastic result for the second stochastic input 
source does not comprise an "n-bcst" alternatives list, converting 
that stochastic result to an "n-best" alternatives list; 

combining the stochastic results for the second stochastic 
input source to create a totalized alternatives list for the second 
stochastic input source. 

(16) The method of claim 15, wherein the step of combining the 
plurality of stochastic results for the second stochastic input 
source to create the totalized alternatives list for the second 
stochastic input source further comprises the steps of: 

creating a single entry in the totalized alternatives list 
for each unique alternative appearing in the plurality of stochastic 
results for the second stochastic input source: 

calculating a probability for each alternative in the 
totalized alternatives list by summing all probabilities assigned to 
that alternative in the plurality of stochastic results for the 
second stochastic input source; and 

assigning each calculated probability to its associated 
alternative. 

(17) A computer-readable aediun having computer-executable 
instructions for performing the method of claim 1. 

(18) A computer adapted to perform the method of claim 1. 

(19) A computer-readable medium having computer-executable 
instructions for performing the nethod of claim 16. 
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(20) A computer adapted to perforn the method of claiD 16. 

(21) A coDiputer'-inpleiDented nethod for correcting text, comprising 
the steps of: 

receiving a text selection from a user; 
receiving a coEisand to display alternatives for the text 
selection; 

submitting the text selection lo a correction scope nodel to 
determine if a scope of correction should be adjusted; 

receiving from the correction scope model a text unit that 
includes the text selection and at least one adjacent word; 

producing a list of alternatives for the text unit; and 

displaying the list of alternatives for the text unit on a 
display device. 

(22) The method of claim 21. further comprising the steps of: 
receiving a user command selecting one of the displayed 

alternatives; and 

replacing the text unit with the selected alternative. 

(23) The method of claim 21. wherein the adjacent word is 
incorrect because of a related error that caused a word within the 
text selection to be incorrect. 

(24) The method of claim 21, wherein the correction scope iBodol 
includes criteria selected from the group consisting of a natural 
language model, a model of likely errors, an acoustic model, a 
handwriting model, and a vision-based model. 
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(25) The method of claiin 21, wherein the step of producing a list 
of alternatives for the text unit further conprises the steps of: 

parsing the text unit into text conponcnts derived fioio 
differeut input sources: 

determining if one of the text components comprises a 
stochastic text component; 

if the one of the text components comprises the stochastic 
text coDponent, retrieving a stochastic model for the stochastic 
text component; and 

combining the stochastic model with other text components to 
produce a list of alternatives for the text unit. 

(26) A computer-readable medium having computer-executable 
instructions for performing the method of claim 21. 

(27) A computer adapted to perform the method of claim 25. 

3 Detailed Description of Invention 
FIELD OF THE INVENTION 

The invention relates generally to methods for entering text 

into a computer and, more particularly, relates to providing 

alternatives for a text selection derived from multiple stochastic 

input sources. 

BACKGROUND OF THE INVENTION 

Computer users have traditionally entered text into word 
processors through a keyboard and mouse. In recent years, however, 
word processors have become more sophisticated by allowing users to 
enter text into them through other input methods, such as speech or 
handwriting. Although a computer cannot always interpret such input 
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with complete accuracy, a computer can generate a list of text 
alternatives for the input, h'urtherniore, the computer can often 
assign to each alternative a probability that the alternative is the 
one the user Intended. Input that produces such probabilistic 
results is called "stochastic input." while input that can be 
accurately deternined, such as typed text, is called "non- 
stochastic input. " 

Typically, text produced in a word processor from stochastic 
Input must be heavily edited by the user in order to produce the 
text intended by the user when he or she created the stochastic 
inpui. The editing process has been simplified by allowing the user 
to select text created from stochastic data and request alternatives 
for the text selection. In response, the computer can provide the 
user with alternatives for the text selection through a graphical 
user interface. If the user chooses one of the alternatives, the 
computer replaces the text selection with the selected alternative. 

Suppose that after a user creates text in a word processor by 
providing the word processor with stochastic input, such as speech, 
the user then edits the text. The user nay, for example, replace a 
word of the text with a new word typed into the computer with a 
keyboard. Current word processors do not incorporate typed text 
edits into the alternatives they provide for an edited text 
selection. Thus, there is a need in the art for a method of 
providing alternatives to edited text derived from stochastic input. 

Another problem occurs if the user attempts to request 
alternatives for a text selection spanning multiple stochastic input 
sources. For inslance. the user may request alternatives for a text 
selection containing a word based on handwriting input and a word 
based on speech input. Current word processors are not capable of 
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providing meaningful alternatives for sach a text selection. Thus, 
there is also a need in the art for a method of providing 
alternatives for a text selection derived from multiple slochastic 
input sources. 

An input melbod editor (IKE) is another word processor input 
method that produces stochastic data Generally, an IME converts 
input into foreign langnage text. The input into an liiE may. for 
example, be typed text entered into the computer through a keyboard 
and mouse. An IME is especially useful for creating ideograms in 
Asian and other languages. Because there are many more ideograms in 
such languages than there are keys on a keyboard, entering a 
particular ideogram into the computer typically requires multiples 
keystrokes, which the lUB interprets as a composed character. 

In a typical IME. a user may type in English characters 
defining a phonetic spelling for a desired Chinese character. Since 
many Chinese characters have similar pronunciations, the typed 
phonetic spelling may represent any one of a number of different 
Chinese characters. The IME then provides the user with the most 
probable candidates corresponding to the typed phonetic spelling so 
that the user can choose the correct one. 

Programmers have previously recognized the value of providing 
speech input into an IME. This is done by first converting the 
speech into text, which is then used as input into the IHB. As has 
already been explained, however, the interpretation of speech is 
stochastic in nature. Hence, the text produced by the speech 
interpreter may not be the text that was intended by the user. If 
incorrect tt;xt is used as Input into the IME. the results produced 
by the IME are likely to be poor. Accordingly, when speech is used 
as input into an IME. the program interpreting the speech data 
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typically allows tbe user to first correct the text produced by the 
speech interpreter before inputting that text into the IHE. When 
the INK produces foreign language translations of the text, the user 
nay again choose the desired alternative because the result of an 
IME is also stochastic in nature. Requiring the user to edit the 
results at tvo different stages of the process can be inefficient 
and inconvenient. Thus, there is a further need in the art for an 
improved nethod of handling speech input to an IME. 

SUMMARY OF THE INVHNTION 

The present invention neets the needs described above in a 
stochastic input combiner that facilitates the editing of text. The 
invention does this by providing alternatives for a text selection 
made by the user, even where that text selection is derived froni 
multiple input sources, one or more of which can be stochastic in 
nature. 

The stochastic input combiner provides the alternatives to 
the user in list form through a graphical user interface. The user 
can then choose one of the alternatives to replace the text 
selection the user has highlighted for editing. This can often be 
quicker than requiring the user to think of alternatives on his or 
her own and then make changes accordingly using a keyboard and mouse. 
If the user does not find an alternative the user likes for the text 
selection, the user can edit the text selection using the keyboard 
and mouse. In response, the stochastic input combiner can provide 
an updated list of alternatives that incorporate the user' s changes. 
Often, the user need only partially edit the text selection before 
the stochastic input combiner produces an alternative tbe user likes, 
so the stochastic input combiner again improves editing efficiency. 
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The stochastic input conbincr may also provide the advantage 
of a natural language nodel. Such a aodel may analyze the text 
selection as a whole using natural language principles in order to 
provide a better list of altcinatives for the text selection. This, 
too, inprovcs the user' s editing efficiency. 

Generally described, the present invention includes a aethod 
for correcting text. The user first enters text into the conputer, 
perhaps using nultiple stochastic input sources. The user nay also 
use a keyboard and nouse to enter text into the conputer. 

Keyboard/DOUse entry of text is an example of an input source 
which is non-stochastic, neaning that the text intended to be 
entered through the source can be determined with cosplete accuracy. 
On the other hand, a stochastic input source is one that converts 
input into a list of alternatives, each having less than lOOX 
probability of being the correct alternative. Because speech cannot 
always be interpreted by a computer with complete accuracy, a speech 
recognition unit is an example of a stochastic input source which 
converts speech input into a list of text alternatives. Other 
examples of a stochastic input source include a handwriting 
recognition unit and an input method editor (IME). Vhere an input 
source for a text component is stochastic, the most likely 
alternative for the text component is generally ust^d to represent 
the text component in the text selection. 

Once the user enters text into the conputer, the user can 
begin the correction process by making a text selection of a portion 
of the text the user entered. This text selection can include 
multiple text components. Such a text component can be a subset of 
the text selection that the user entered through a single input 
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source* The user may have entered different text components Titiiin 
the text selection using different input sources, and the user may 
have entered one or more of the text components with a stochastic 
input source. 

Once the user makes a text selection, the user nay enter a 
command to display alternatives for the text selection as a whole. 
The stochastic input combiner then parses the text selection into 
its text components and retrieves a stochastic model representing 
the alternatives for a text component originating from a stochastic 
input source. This stochastic model may include a list of the 
alternatives for the text component together with the probabilities 
associated with the alternatives. Alternatively, the stochastic 
model nay include a lattice. 

The stochastic input combiner then combines the stochastic 
model retrieved with other text components to produce a list of 
alternatives for the text selection as a whole. The stochastic 
input combiner then displays this list of alternatives for the text 
selection on a display device, such as a monitor. The user may then 
select one of the displayed alternatives. In that case, the 
selected alternative replaces the text selection. 

The stochastic input combiner may also utilize a natural 
language model. In this alternative, the stochastic input combinci 
may combine the stochastic models for each stochastic text component 
to produce an interim list of alternatives for the text selection 
that the combiner provides to the natural language model. The 
natural language node! forms a revised list of alternatives by 
reevaluating Ihe interim list of alternatives based on natural 
language principles applied by the natural language model to the 
text selection as a whole. The natural language model may also add 
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nev alternatives to the revised list of alternatives that are not 
lound in the interim list. After the natural language model returns 
the revised list of alternatives to the stuchastic input conbiner. 
the stochastic input combiner provides the revised list of 
alternatives for display. 

In another alternative, a series of stochastic input sources 
creates a stochastic text component. This means that at least one 
stochastic input source produces a stochastic result that serves as 
input Into a second stochastic input source. Typically, the first 
stochastic input source of the scries requires user input, vhile 
subsequent stochastic input sources in the series receive an 
alternative produced by the previous stochastic input source as 
input. An example of this is a speech recognition unit that 
produces text used as input into an IME. When stochastic input 
sources in series produce a stochastic text component, the 
stochastic input combiner can produce a series stochastic model that 
contains alternatives for the stochastic text component and accurate 
probabilities for those alternatives that account for the likelihood 
of the inputs into each stochastic input source in the series. This 
process eliminates any need for the user to choose a single 
alternative produced by a stochastic input source to use as input 
into a subsequent stochastic input source. 

To produce a series stochastic model for a text component 
derived from a scries of stochastic input sources, the stochastic 
input combiner first submits to the first stochastic input source in 
the series the user input intended for that stochastic input source. 
By processing the user input, the first stochastic input source 
produces a stochastic result. The stochastic result that the first 
stochastic input source produces has multiple alternatives, and the 
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stochastic input combiner selects that stochastic result. By using 
each alteraative of Ihe selected stochastic result as input into the 
second stochastic input source to produce a stochastic result for 
the second stochastic input source, the stochastic input combiner 
produces multiple stochastic results, each stochastic result having 
Dulliple alternatives, for the second stochastic input source. If 
any stochastic result for the second stochastic input source does 
not contain an '^n-best" alternatives list, the stochastic input 
coDbiner converts that stochastic result into an "n-best" 
alternatives list because converting all stochastic results into the 
sane fornat sinpllfies the process of combining them. The 
stochastic input combiner then combines the stochastic results for 
the second stochastic input source lo create a totalized 
alternatives list for the second stochastic input source. If there 
are only two stochastic input sources in the series, then the 
totalized alternatives list may serve as the stochastic model for 
the teit component resulting from the series. 

The stochastic input combiner may also be functional for 
expanding the scope of correction for a text selection received from 
a user to a larger text unit. To do this, the stochastic input 
combiner submits the text selection to a correction scope model to 
malce the determination of whether the scope of correction should be 
adjusted. In response to submitting the text selection, the 
stochastic input combiner receives from the correction scope model a 
text unit thai includes the text selection and at least one adjacent 
word. Using the text unit, the stochastic input combiner can then 
produce a list of alternatives for the text unit and display those 
alternatives on a display device. 
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The various aspects of the present invention nay be nore 
clearly understood and appreciated from a reviev of the folloving 
detailed description of the disclosed cabodinents and by reference 
to Ibe appended dravlngs and clains. 

DHTAILED DKSCRIPTION OF ITE EXEMPLARY EMBODIMENTS 

The present invention is typically embodied in a word 
processor tliat can receive input from multiple sources, each of 
which can be a non-stochastic input source or a stochastiu input 
source. Keyboard/nous e entry of text Is an example of an Input 
source which is non-stochastic, meaning that the computer can 
determine the text the user intended with complete accuracy. On the 
other hand, a stochastic input source is one that converts input 
into a stochastic result. A stochastic result is one bavins 
multiple alternatives, each having less than lOOX probability of 
being the correct alternative. An example of a stochastic input 
source is a speech recognition unit, which converts speech input 
into a list of text alternatives since a computer cannot always 
interpret speech with coDU)lete accuracy. Other examples of a 
stochastic input source are a handwriting recognition unit and an 
input method editor (IME). 

Tbc word processor is functional for allowing the user to 
select a section of text and to request alternatives for that 
selection. If the computer has created the text selection from one 
or more stochastic input sources, there will be alternatives for the 
text selection. After the word processor determines alternatives 
for the text selection, the word processor displays the alternatives 
through a graphical user interface. If the user chooses one of the 
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alternatives for the text selection, then the woi'd processor 
replaces the text selection with the chosen candidate. 

After exanining the list of alternatives that tie vord 
processor provides, the user may not find an acceptable alternative 
for the text selection. Hence, the vord processor may allow the 
user to edit the text selection using a keyboard and nouse. The 
user nay. for example, change one of the words in the text selection. 
In that case, the word processor may then revise the list of 
alternatives to incorporate the edit and provide the revised list of 
alternatives to the user. If the user chooses one of the revised 
alternatives for the text selection, then the word processor 
replaces the text selection with the chosen alternative. 

A progran nodule called a stochastic input combiner is 
typically responsible for producing the alternatives for a text 
selection. The stochastic input combiner does this by parsing the 
text selection into smaller text components derived from no more 
than one stochastic input source. For each stochastic text 
component, the stochastic input combiner then retrieves a stochastic 
model representing the alternatives for the text component. Then, 
the stuchastic inpul combiner can combine the stochastic models 
retrieved with other text components to produce a list of 
alternatives for the text selection as a whole. 

The stochastic input combiner can be part of the word 
processing application. Alternatively, the stochastic input 
combiner can be a separate utility that is part of the operating 
system. The combiner could also be a separate program that 
interfaces with the word proccsisor but that is not part of the 
operating systea 
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To inprove the list of alternatives the word processor offers 
for a text selection, the vord processor nay use a natural language 
model. The natural language model aay apply natural language 
principles to the text selection as a whole to reevaluate the 
likelihood of the alternatives in the alternatives list produced by 
the stochastic input conbiner and to add new alternatives to the 
alternatives list. Alternatively, the natural language loodel nay 
apply natural language principles to individual text components to 
improve the stochastic models that the stochastic input combiner 
uses to produce the alternatives list. 

A text component is occasionally derived from a series of 
stochastic input sources. This means that at least one stochastic 
input source pruduces a stochastic result that serves as input into 
a second stochastic input source. The first stochastic input source 
in the series typically receives user input, while the last 
stochastic input source in the series produces the alternatives for 
the text component. The stochastic input conbiner can derive a 
series stochastic nodel containing alternatives for such a text 
component without requiring the user to select a single alternative 
result for each stochastic input source in the series as input for 
the subsequent stochastic input source. 

To derive a series stochastic model for a text component, the 
stochastic input combiner first selects the user Input that 
ultimately produced the text component. The stochastic input 
combiner then submits the selected user input to the first 
stochastic input source in series order. The stochastic input 
combiner then submits each alternative produced by the first 
stochastic input source as input into the subsequent stochastic 
input source in series order. Because the subsequent stochastic 
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input soarce produces a stochastic result fron each alternative 
subnitted to it. the stochastic results of the snbsequent stochastic 
Input source must be combined into a totalized candidate list. If 
there is yet another stochastic input source in the series, the 
stochastic input conbiner subaits each candidate of the totalized 
candidate list as inpnt into that next stochastic Input source. The 
totalized candidate list that the last stochastic input source of 
the series produces in this fashion is the series stochastic model. 

Often, stochastic Input sources make an error vbich spans 
multiple words. During the correction process, a user nay not 
notice the full extent of that error. For example. If the user 
dictates the word ''recognize." the speech recognition engine might 
conclude that the most likely interpretation of the speech input is 
"wreck a nice." While editing, the user might see the word 
"wreck" and request alternatives only for that word because the 
user did not notice that the following words were also incorrect 
(i. e. . "a nice" ). 

If a user makes a text selection that does not include 
adjacent words that are incorrect because of a related error and if 
the word processor only uses the user' s text selection to produce 
an alternatives list, none of the alternatives offered for the text 
selection may be the text the user intended at the time of input. 
Similarly, replacing the text selection with an alternative chosen 
by the user from the alternatives list would leave the incorrect 
adjacent words in the text. 

To eliminate these disadvantages, the stochastic input 
combiner may submit the text selection to a correction scope model 
which determines If the scope of correction should be expanded. In 
the "recognize" example, an appropriate text unit for correction 
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would be "wreck a nice." To nake this detemination, the 
correction scupe Qodel nay drav on information iiicluded in a natural 
language model, models of likely errors, and models tied to the 
input methods used to produced the text in the word processor. The 
models associated with the input methods may include acoustic models 
for speech recognition, handwriting models for handwritten input, 
and vision-based models for recognizing sign language or other 
gestures. 

If the correction scope model determines that the scope of 
correction should be adjusted, the correction scope model identifies 
one or more larger text units for which the stochastic input 
combiner should produce alternatives in the manner already described. 
The correction scope model sends a list of these text units to the 
stochastic input combiner for processing. 

Turning now to the figures, in which like numerals refer to 
like elements throughout the several figures, aspects of the present 
invention will be described. 

Exemplary Operating Environment 

FIG. 1 and the following discussion are intended to provide a 
brief and general description of a suitable computing environment 
100 for an implementation of the present invention. The exemplary 
operating environment 100 includes a conventional personal computer 
system 120, including a processing unit IZl, a system memory 122. 
and a system bus 123 that couples the system memory 122 to the 
processing unit 121. The system memory 122 includes read only 
memory (ROM) 124 and random access memory (RAM) 125. A basic 
input/output system 126 (BIOS), containing the basic routines that 
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help to transfer information between elements within the personal 
computer systen 120. such as during start-up. is stored in ROM 124. 

The personal conputcr systen 120 further includes a bard disk 
drive 127. a magnetic disk drive 128, e.g.. to read from or write to 
a rcBOvable magnetic disk 129, and an optical disk drive 130. e.g., 
for reading a CD-ROM disk 131 or to read from or write to other 
optical media. The bard disk drive 127. magnetic disk drive 128» 
and optical disk drive 130 are connected to the system bus 123 by a 
hard disk drive iaterface 132. a magnetic disk drive interface 133. 
and an optical drive interface 134, respectively. The drives and 
their associated computer-readable media provide nonvolatile storage 
for the personal computer system 120. Although the description of 
computer-readable media above refers to a hard disk, a removable 
magnetic disk and a CD-ROM disk, it should be appreciated by those 
skilled in the arl that other types of media that are readable by a 
computer system, such as magnetic cassettes, flash memory cards, 
digital video disks. Bernoulli cartridges, and the like, may also be 
used in the excmplaiy operating environment. 

A user may enter commands and information into the personal 
computer 120 through conventional input devices, including a 
keyboard 140 and pointing device, such as a mouse 142. A microphone 
161 may be used to enter audio input, such as speech, into the 
computer system 120. A user may enter graphical information, such 
as drawings or handwriting, into the computer system by drawing the 
graphical information on a writing tablet 162 using a stylus. The 
computer system 120 may include additional input devices (not shown), 
such as a joystick, game pad. satellite dish, scanner, touch 
screen/stylus, or the like. The microphone 161 can be connected to 
the process in.(^ unit 121 through an audio adapter 160 that is coupled 
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lo the system bus. The other input devices are often connected to 
the processing unit 121 through a serial port interface 146 that is 
coopled to the systeo bus, but laay be connected by other interfaces, 
such as a game port or a universal serial bus (USB). 

A aonitor 147 or otber type of display device is also 
connected to the system bus 123 via an interface, such as a video 
adapter 148. In addition to the monitor, personal computer systems 
typically include other peripheral output devices (not shown), such 
as speakers or printers. 

The personal computer system 120 may operate in a netvorked 
environment using logical connections to one or more remote computer 
systems, such as a remote computer system 149. The remote computer 
system 149 may be a server, a router, a peer device or other common 
network node, and typically includes many or all of the elements 
described relative lo the personal computer system 120, although 
only a memory storage device ISO has been illustrated in FIG. 1. 
The logical connections depicted in FIG. 1 include a local area, 
network (LAN) 161 and a vide area network (WAN) 152. Such 
networking environments are commonplace in offices, enterprise-wide 
computer networks, intranets and the Internet. 

When used in a LAN networking environment, the personal 
computer system 120 is connected to the LAN 151 through a network 
interface 153. When used in a VAN networking environment, the 
personal computer system 120 typically includes a modem 154 or other 
means for establishing communications over a WAN 162. such as the 
Internet. The modem 154. which may be internal or external, is 
connected to the system bus 123 via the serial port interface 146. 
In a netvorked environment, program modules depicted relative to the 
personal computer system 120. or portions thereof, may be stored in 
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the remote meinory storage device 150. It will be appreciated that 
the network connections shown arc exenplary and other neans of 
establishing a comiQuni cat ions link between the computer systens may 
be Qsed. li will be further appreciated that the invention could 
equivalently be inplemented on host or server compater systens other 
than personal computer systems, and could equivalently be 
transmitted to the host computer system by means other than a CD-ROU 
for example, by way of the network connection interface lb3. 

A number of program nodules may be stored in the drives and 
RAM 125 of the computer system 120. Program modules control how 
the computer system 120 functions and interacts with the user, with 
I/O devices or with other computers. Program modules include 
routines, operating system 135. application program modules 138. 
data structures, browsers, and other software or firmware components. 
The invention may conveniently be implemented in one or more program 
modules, such as a stochastic input combiner program module 1S7 and 
a stochastic input interface program module 139, each of which Is 
based upon the methods described in the detailed description. 

The application program modules 138 may comprise a variety of 
applications used in conjunction with the present invention, some of 
which axe shown in FI(L 2. The purposes of and interactions between 
some of these program modules are discussed more fully in the text 
describing FIG. 2. These Include a word processor program 210 (such 
as WORD, produced by Microsoft Corporation of Redmond, WA). a 
handwriting recognition program module 230, a speech recognition 
Mrogran module 240. and an input method editor (IME) 250. 

No particular programming language will be described for 
carrying out the various procedures described In the detailed 
description because it is considered that the operations, steps, and 
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procedures described and illustrated in the accooipaDying drawings 
are sufficiently disclosed to permit one of ordinaiy skill in the 
art to practice an exemplary embodinent of the present invention. 
Moreover, there are many computers and operating systems which nay 
be used in practicing an exemplary embodiment, and therefore no 
detailed computer proc;ras could be provided which would be 
applicable to all of these many different systems. Each user of a 
particular computer will be aware of the language and tools which 
are DOSt useful for that user's needs and purposes. 

Those skilled in the art will appreciate that the invention 
may be practiced with other computer system configurations, 
including hand-held devices, multiprocessor systems, microprocessor- 
based or programmable consumer electronics, minicomputers, mainframe 
computers, and the like. The invention may also be practiced in 
distributed computing environments where tasks are performed by 
remote processing devices that are linked through a communications 
network. In a distributed computing environment, program modules 
may be located in both local and remote memory storage devices. 

Overview of Program Modules 

FIG. 2 provides an overview of the program modules of a 
multi-source data processing system 200. Generally, the program 
modules shown in FIG. 2 enable a user to enter text into an 
application 210. such as a word processor, using both stochastic and 
non-stochastic input sources. Typical stochastic input sources 
include a handwriting recognition program module 230. speech 
recognition program module 240, input method editor (IME) 250. and 
speech recognition program module 260. A keyboard 140 is a typical 
source of non-stochastic data. Once the user enters text into the 
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word processor 210 through one or more of these input sources, the 
user nay then select a section of text and request a candidate list 
of alternatives for that text selection. The text selection may 
contain input from Bultlple stochastic and non-stochastic input 
sources. As long as the text selection is derived fron at least one 
stochastic input source, there vUl be alternatives for the text 
selection. The program nodules can produce this candidate list and 
present them to the user through a graphical user interface. If the 
user chooses one of the candidates, the text selection is replaced 
with the chosen candidate. The operation of stochastic input 
sources 230, 240, 260. and !60 are now discussed in turn. 

The handwriting recognition program module 230 receives 
handwriting input 280 from the user. The user may generate the 
handwriting input 280 by writing on the writing tablet 162 with a 
stylus. Alternatively, the user may generate hundwriling input 280 
using other devices. For instance, the user may write on the 
monitor 147 with a mouse 142. or the user may write on a touch 
screen using a stylus. After input, the handwriting input 280 Is 
preferably directed to the handwriting recognition program module 
230 by a writing tablet driver nodule in the operating system 135. 

As handwriting is often difficult for a computer to interpret, 
the handwriting recognition program module 230 cannot always 
decipher the handwriting input 280 with complete accuracy. The best 
the program nodule 230 can do is to generate alternatives for the 
handwriting input 280 and assign a probability that each alternative 
is the correct one: By definition, then, the handwriting 
recognition program module 230 generates a stochastic result. The 
stochastic model 270a includes a data structure containing the 
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stochastic data produced by processing handwriiins input 280 with 
the handwriting recognition program nodule 230. 

Although any data structure capable of storing stochastic 
data can contpiise a stochastic model 270, two useful structures for 
doing so are a lattice and an "u-best" alternatives list. A 
lattice is a structure that is well known to those skilled in the 
art, so a complete description will not be given. Briefly, however, 
a lattice stores words or phrases produced by a stochastic input 
source in nodes. Because each word or phrase is stochastic data, 
the node also stores a probability assigned to the associated word 
or phrase. Using methods well known to those skilled in the art, 
the lattice can be traversed in order to produce likely alternatives 
for any section of text represented by the stochastic data. 
Furthermore, lattices repiesentins adjacent pieces of text can be 
combined into a metalattice through a process known as concatenation. 
The metalattice can then be traversed to produce alternatives for 
the adjacent pieces of text. 

Alternatively, stochastic data nay be represented by a list 
of the n-best alternatives and their associated probabilities. For 
any given word or phrase, an n-best alternatives list may be 
produced from a lattice representing the word or phrase. 

The speech recognition program module MO works like the 
handwriting recognition program module 230. except that it receives 
speech input 290 from the user through a microphone 161 run by a 
microphone driver module in the operating system 13b. Speech is 
often difficult to interpret because many words that sound alike 
have different meanings and spellings, so the speech recognition 
program nodule 240 also produces a stochastic resolt. The 
stochastic model Z70b stores the data structure containing the 
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stochaslic data produced by processing speech input 290 with the 
speech recognition progran module 240. 

An input nelhod editor (IME) 250 also generates stochastic 
data. Generally, an IME ?50 converts input into foreign language 
text. The input into an 1MB 250 may, for exanple, be typed text 
entered into the computer through a keyboard 140 and mouse 142. The 
stochastic Dodel 270c includes a data structure containing the 
stochastic data produced by the IME 2s>0. 

An IME 250 is especially useful for creating ideograDs in 
Asian and other languages. Because there arc many more ideograns in 
such languages than there are keys on a keyboard, entering a 
particular ideogram into the computer is problematic vithout an 1MB 
250. In a typical 1MB 2b0. a user types in Knglish characters a 
phonetic spelling for a desired Chinese character. Since many 
Chinese characters have similar pronunciations, the typed phonetic 
spellings may represent any one of a number of different Cliinese 
characters, and the IME 260 produces a stochastic result. The IME 
200 then provides the user vith the most probable candidates 
intended by the typed phonetic spelling so that the user can choose 
the correct one. 

The stochastic results produced by one stochastic input 
source may serve as stochastic input to a second stochastic input 
source. Then this is the case, the stochastic input soorces are 
"series stochastic input sources," and the stochastic input 
sources can be described as configured "in series," This is 
lllustraied by the configuration 293 of program modules, which also 
demonstrates another embodiment of au IMH 250. 

In this embodiment, English speech input 262 may be entered 
into the computer and used to produce Japanese text. The speech 262 
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is first subnitled to a speech recognition program module 2G0, In 
operation, the speech recognition progran module 260 functions much 
like the speech recognition program module 240. but it is 
illustrated as a distinct unit because it nay have a different 
speech interpretation engine. For example, the speech recognition 
program module 250 nay interpret a different language than the 
speech recognition program module 240. The stochastic model 270d 
includes a data structure containing the stochastic data produced by 
processing speech input vltb the speech recognition program module 
260. 

In an English speech/Japanese IME example, the speech 
recognition program module 260 may produce English text alternatives 
from the spoken English vords and store (hem in the stochastic model 
270d. One or more of the English language text alternatives stored 
in the stochastic model 270d can then be used as input into (he IME 
250. vhich translates the English language text input into Japanese 
characters. Each alternative input into the lUB 2b0 produces a 
separate stochastic result, though it should be appreciated that 
there may be overlap between (he alternatives forming the stochastic 
results of two distinct inputs into the IME 250. 

Though the arrow in FIG. 2 from the speech recognition 
program module 260 to 1MB 250 illustrates that the speech 
recognition program module is a stochastic input source for the IME 
250, it should be understood that the two program modules nay not 
interface directly. Thus, for example, stochastic input from speech 
recognition program nodule 260 to IMH 250 nay travel through an 
interface program module, such as stochastic input interface 139. to 
which each stochastic input source is directly connected. 
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A stochastic input interface 139 serves as a conduit for 
stochastic data between an application 210 that is to receive 
stochastic data and a stochastic input source, such as handwriting 
recognition program nodule 230, speech recognition progran nodule 
240. or IME 250. One advantage of having a stochastic input 
interface 139 as a conduit for stochastic data is that it sioplifies 
conuDunlcation between the application 210 receiving the stochastic 
data and the stochastic input sources. That is, the application 
only needs to know how to conmunicate with the stochastic input 
interface instead of all possible stochastic input sources. The 
application 210 that is to receive stochastic input is a word 
processor in an exemplary embodiaent of the present invention. 
However, the application 210 could also be a spreadsheet, browser, 
electronic mall program, music transcription program. CAD program, 
presentation software (such as PowerPoint, produced by Microsoft 
Corporation of Kedmond, Washington), operating system, or other 
software program. 

In the word processor embodiment, the word processor 210 
receives, though stochastic input interface 139. text representing 
the most likely alternative from each stochastic input source used 
to enter data into the word processor. In addition to transmitting 
data into the word processor 210 through multiple stochastic input 
sources, the user may also enter typical non-stochastic data into 
the word processor, such as by typing on a keyboard 140. The word 
processor 210 combines all this source data into a nulti-source text 
string that is presented to the user. Although the word processor 
210 docs not indicate to the user the source of each word of the 
text, the word processor nonetheless maintains a record of the 
source of each component of the text. 
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The word processor 210 is also functional for allowing the 
user to identify a section of text and to request alternatives for 
that selection. If the text selection is derived fron one or nore 
stochastic input sources, there will be alternatives for the text 
selection. The word piocessor 210 can request a candidate list of 
alternatives fron the stochastic input interface 139 by providing It 
with the text selection and the sources of each of the components of 
that text selection. After the request is processed, the stochastic 
input interface 139 provides the word processor 210 with a candidate 
list for the entire text selection. The word processor 210 provides 
the candidate list to the user through a graphical user interface. 
If the user chooses one of the alternatives for the text selection 
from the candidate list, then the word processor replaces the lexl 
selection with the chosen candidate. 

In order to process the request for a candidate list of 
alternatives for a text selection, the stochastic input interface 
139 Iransmits the request to the stochastic Input combiner 131 By 
communicating with the stochastic input sources through the 
stochastic input interface 139, the slochastic input combiner 13Y 
can retrieve information about the slochastic models 270 needed to 
produce the candidate list for the Icxt selection. 

To produce the candidate list, the stochastic input combiner 
137 may optionally consult a natural language model 220. To do so, 
the combiner 137 first produces an interim candidate list of 
alternatives for the text selection using the information retrieved 
from the stochastic models 270. After the combiner 137 provides the 
interim candidate list to the natural language model 220. the 
natural language model analyzes the interim candidate lisi using 
clues such as grammar, the overall meaning of a section of text, and 
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the probability of various word sequences. Based upon this analysis, 
the natural language loodel 220 produces additional alternatives for 
the candidate list and reevaluates the probabilities of those 
alternatives in the interim candidate list. The methods used to 
produce a candidate list of alternatives for a text selection will 
be described with reference to PIGS. 3-9 and 11. 

As shown in FIG. 2. stochastic input sources 230, 240. and 
230 can each provide stochastic data to word processor 210 without 
first filtering their stochastic data through another stochastic 
input source. In other words, stochastic input sources 230, 240. 
and 250 can each directly (through stochastic input interface 139) 
transnil stochastic data to the word processor 210. and stochastic 
data from each source can be incorporated into the same word 
processing docunenl. For this reason, they are "parallel 
stochastic input sources" 296, and these stochastic input sources 
may be described as configured "in parallel." 

Although the various program modules have been described 
separately, one skilled in the art should recognize that the modules 
could be combined in various ways and that new program modules could 
be created to accomplish similar results. In particular, the 
stochastic input combiner 137 and the natural language model 220 
could reside in the stochastic input interface 139. and all three 
program modules could be part of the operating system 135 or the 
word processor 210. Also, the combiner 137 and the natural language 
model 220 could be separate programs that interface with the word 
processor 210 directly. Similarly, the stochastic input sources 230, 
240, 250, and 260 could be stand-alone application program modules 
138. or they could be part of the operating system 135. 
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Graphical Illustration of a Typical Implementation 

PIGS. 3 5 and 11 illoslrate what the user sees and does in a 
typical implementation of the present invention. Furthermore, these 
figures show the fanctionality of the stochastic input combiner 137 
and the natural language model 220. 

In FIG. 3. a conputer 120 with multiple text entry methods 
accepts input from a user and transmits that input to an application 
210, such as a word processor. The computer converts that input 
into a text string 300, which it displays on monitor 147. In this 
example, the user intended to produce the text "THIS IS A MESSAGE 
WRITTEN BY A THOUSAND MONKEYS TYPING AT RANDOM." However, the 
computer interpreted the stochastic input as "TIIIS IS A MHSSAGE 
WRITTEN BY A TOWN OF MY KEYS TAIPING AT RANDOM" to produce text 300. 

Once the text 300 is displayed, the user may make a text 
selection 310 by highlighting a portion of the text. The text 
selection 310 in FIG. 3 has three text components 312, 314. and 316. 
Kach of the text components 312, 314, and 316 comes from a 
different stochastic input source. Thus, for example, text 
component 312 may be one of the alternatives produced by processing 
handwriting input 280 with the handwriting recognition program 
module 230. The alternatives produced by this stochastic input 
source are stored in the stochastic model 270a. The alternatives 
list 318 is a list of the alternatives for text component 312 stored 
in the stochastic model 270b. Furthermore, the computer has chosen 
"TOWN OF" for text component 312 because the computer determined 
that that is the most likely alternative in alternatives list 318 
for the handwriting input 280 that was used to produce the text 
component 312. As shown by alternatives list 318, however, the 
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computer also recognizes thai "GOWN OF" and "THOUSAND" are 
possible alternative phrases for text component 312. 

Similarly, speech recognition program module 240 produces the 
stochastic result stored in stochastic model 270b by processing 
speech input 290. The alternatives list 320 contains the 
alternatives stored in stochastic model 270b for text component 314. 
"MY KBTS" has been selected for text component 314 because it is 
the most likely alternative. 

Likewise, text component 316 "TAIPING" comes from yet a 
third stochastic input source. This stochastic input source stores 
its alternatives in stochastic model 270c, and the alternatives are 
represented in list form in alternatives list 322. 

The stochastic inpot combiner 137 forms various combinations 
of alternatives from alternatives list 318. 320. and 322. The 
stochastic inpul combiner 137 then produces a ranked list of the 
various combinations it has produced based on its calculation of the 
probability that each combination is the one intended by the user 
for text selection 310. The top ranked alternatives for text 
selection 310 are then displayed in alternatives list 330 on the 
monitor 147. 

After the alternatives list 330 for the text selection is 
displayed, the user may choose to edit the text selection, as shovn 
in FIG. 4. In that figure, the user has made edit 410 by typing the 
word "THOUSAND" over the words "TOWN OF" from FIG. 3. As a 
result, text component 312 of text selection 310 is replaced with 
text component 312'. 

Edit 410 may have been accomplished by using a keyboard 140 
and a mouse 142. Because such an entry method is non-stochastic in 
nature, there are no alternatives for text component 312*. This 
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change is reflected in alternatives list 318', which has replaced 
alternatives list 318 fron PIG. 3. The only alternative shown in 
alternatives list 318* Is "THOUSAND." 

After the edit is conpleted, stochastic input combiner 137 
again forms various coisbinations of alternatives from alternatives 
lists 318\ 320. and 322 in order to form alternatives for the 
edited text selection 310. These alternatives are displayed as 
alternatives list 430 on monitor 147. 

If the user desires a different alternative to replace text 
selection 310 than the alternatives displayed in alternatives list 
430. the user may again edit text selection 310. This is shown in 
FIG. II. with the user making edit 412 by typing the word 
"MONKEYS" over the phrase ''MY KETS" that appeared in text 
selection 310 in FIG. 4. As a result, text component 314'. 
"MONKEYS." replaces text component 314. "MY KEYS. " This further 
results in the replacement of alternatives list 320 with 
alternatives list 320\ which contains only one alternative because 
an edit is non -stochastic in nature. Once again, the stochastic 
input combiner. 137 combines alternatives from the various 
alternatives lists 318'. 320'. and 322 in order to produce 
alternatives list 430' for text selection 310. 

At that point, the user may find an alternative 414 which he 
wishes to replace text selection 310. If so. he may highlight and 
choose alternative 414, in which case alternative 414 replaces the 
text selection 310 to produce new text 416. 

PIG. 5 is similar to FIG. 3. except that FIG. 5 illustrates 
an embodiment of the invention in which the natural language model 
220 is operative. As in FIG. 3. a user makes text selection 310 
comprised of text components 312. 314, and 316. The stochastic 
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input combiner 137 then forms various conbinations of alternatives 
for the text components to produce an interim list of alternatives 
for text selection 310. Instead of dispUyins the most probable 
alternatives thus derived, the stochastic input coubiner 137 then 
passes the interim list of alternatives for the text selection 310 
to a natural language model 220. 

The natural language model 220 then re-evaluates the 
probabilities of the various alternatives in the interim 
alternatives list based on natural language principles applied by 
the natural language model to the text selection 310 as a vhole. 
This includes analysis of granoatical and other language clues in 
the text selection. The natural language model 220 may also form 
additional alternatives for the text selection 310 not found in the 
interim list of alternatives provided to the natural language model 
220. The natural language model 220 returns to the stochastic input 
combiner 13Y a revised list of alternatives based on the re- 
evaluated alternatives in the interim alternatives list and the 
additional alternatives it has produced. The stochastic input 
combiner 137 then chooses the top ranked alternatives for display in 
alternatives list 530. 

Because of the operatiun of the natural language model, the 
alternatives list 530 is hopefully hettcr than alternatives list 330 
of FIG. 3. If that is the case, the user may choose an alternative 
1)02 from alternatives list 530 vithoul having lo edit the text 
selection 310. In the example shown in PIG. 5, nevr text 504 is 
formed by the user replacing text selection 310 with alternative b02. 
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Flow Charts for a Typical Implementation 

FIG. 6 is a flow chart of the steps in a typical routine 600 
for processing source data. This routine illustrates the steps that 
implement the embodinenls of the invention described with respect to 
FIGS. 3 5 and IK The routine 600 begins al step 602 with the word 
processor receiYing source data fron multiple input sources. An 
input source may be a stochastic input source, such as handvritins 
recognition program module 230. speech recognition program nodule 
240. or input method editor 250. An input source could also be non- 
stochastic, such as typed data entered using a keyboard 140 and 
mouse 142. Purtbermore. some of the source data may come from two 
or more stochastic input sources configured in series. If data is 
derived from stochastic input sources configured in series, each 
stochastic input source nay be counted as a different input source. 

After the word processor receives source data from multiple 
input sources, the word processor combines that data into a multi- 
source text string in step 604. This means that the word processor 
creates text corresponding to the source data so that the word 
processor can display the text on the monitor 14Y. Furthernoret 
word processor creates a data structure to keep track of the source 
of each word of the text. 

In step 80S, a user may make a text selection including a 
portion of the displayed text. This text selection may include text 
from multiple input sources. The user may, for example, make the 
text selection by depressing a mouse button at the beginning of the 
text selection, dragging the mouse to the end of the desired text 
selection, and then releasing the mouse button. Preferably, the 
word processor highlights the text selection to indicate what has 
been selected. 
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In slep 808, the word processor receives a "display 
alternalives" conuiand for the text ^selection. In response, the 
word processor determines alternatives for the multi-source text 
selection in step 610. 

In step 612. the vord processor displays those alternatives 
on the monitor UV. The word processor preferably displays the 
alternalives in probabilistic order through a graphical user 
interface that allows the user to make a selection from the 
displayed alternatives. The graphical user interface may appear in 
a sub-window that the user can move around so as to reveal text 
hidden by the sub-window. 

In step 614. the user gives the word processor a coninand. 
Kxanples of possible comnands include the selection of a displayed 
alternative, an attenpt to edit the text selection, or an atteinpi to 
make a new text selection by depressing the mouse button to anchor 
the mouse at a point of the text outside of the text selection. 

In step 616. the word processor determines if the user has 
selected a displayed alternative. If the user has selected a 
displayed alternative, then step 618 is performed, in which the word 
processor replaces the text selection with the selected alternative. 
After step 618. the word processor discontinues the display of 
alternatives in step 624 before the routine ends at step 626. After 
step 626. the routine nay be repeated by a return to step 602. 

Returning to step 616, if the user has not selected a 
displayed alternative, then step 620 is performed. In step 620. the 
word processor delcrniaes if the user is editing the text within the 
text selection. If the user is editing Ihe text within the text 
selection, then the word processor processes that edit in step 622. 
After the word processor completes the edit, the routine loops back 
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to step 610 to deterisine nev alternatives for the edited text 
selection. 

Returning to step 6?0, if the user conmand received in step 
614 vas not an edit comiDand within the text selection, then the vord 
processor perforns step 624. In this case, the user has initialed 
the creation of a ncv text selection by depressing the mouse outside 
of the old text selection. Hence, the vord processor discontinues 
the display of alternatives iu step 624 before the routine ends at 
step 626* Once again, the routine may be repeated by returning to 
step 60Z. 

FIG. 7 details the steps of routine 610 from FIG. 6. The 
routine describes the steps for deternining alternatives for a 
multi-source text selection. Typically, a stochastic input combiner 
performs this routine. This combiner nay be a program module in the 
word processor, a separate utility in the operating system, or a 
separate program that interfaces with the word processor. 

The routine begins with step 702, in which the stochastic 
input combiner parses the text selection into text components 
originating from different input sources. To do so. the stochastic 
Input combiner consults the data structure that stores the source of 
each word in the text string displayed on the inonitor 147. Parsing 
the text selection into text components makes the process of 
determining alternatives for the text selection more manageable. 

One skilled in the art should recognize that multiple 
definitions of a text component are possible. Using a different 
definition than the embodiment described in FIGS. (-9 will require a 
different parsing step 702, as well as other appropriate 
modifications of the routines described therein. A text component 
could, for example, be a single word. Or, a text component could be 
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a phrase composed of words originating from Ihe same input source. 
In the latter case, a phrase derived from a stochastic input source 
could be a different text component than an edit inserted into the 
middle of that phrase. 

FIGS. 7 9 illustrate an example in which a text component is 
defined as the largest unit of text derived from a different input 
source or series of stochastic input sources than its neighbors, 
together with any typed text that has edited that text component. A 
unit of typed text that is not an edit of a text component 
originating from a stochastic inpat source is considered its own 
text component. For example, typed text a user has inserted between 
text components originating from different stochastic input sources 
is considered its own text component. 

In step 704, the stochastic input combiner retrieves a 
stochastic model for each text component originating from a 
stochastic input source. In step 706. the stochastic input combiner 
determines if all stochastic models are lattices. If all stochastic 
models are lattices, then step 708 is performed. 

In step V08, the stochastic input combiner concatenates all 
the lattices retrieved into a metalattice. In order to create the 
metalattice. the stochastic input combiner creates nodes to 
incorporate any typed text components for typed text that is not 
incorporated into a text component from a stochastic input source. 
Using lattice traversal methods well known to those skilled in the 
art. the stochastic input combiner traverses the metalattice to form 
a list of alternatives for the text selection in step 710- After 
step 710. control passes to step 718. which will be discussed after 
the "NO" path from step 706 is described 
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Returning to step 706. if all stochastic models retrieved in 
step 704 are not lattices, then the stochastic input conhiaer 
performs step 712. In tbis case, at least une of the stochastic 
models is an "n-best" candidate list. Sa the stochastic input 
combiner converts each of the lattices into an "n-best" candidate 
list. This is necessary for the stochastic input combiner to 
perform step 714. 

In step 714, the stochastic input combiner combines the "n- 
best" candidate list for each of the text components with typed 
text components in order to form a combined list of alternatives for 
the text selection. The stochastic input combiner does this by 
forming all possible combinations of alternatives, one from each 
"n-bcsl" candidate list for a text component. For each 
combination, the stochastic input combiner arranges the alternatives 
for the text components in the order in which the text components 
appear in the text selection. The list of all arrangements thus 
formed comprises the list of alternatives for the text selection. 
After step 714, control passes to step 716. 

Once control passes to step 716 from cither 710 or 714. the 
stochastic input combiner has formed a list of alternatives for the 
text selection. Step 716 is an optional step in which a natural 
language model is operative. If this step is performed, the 
stochastic Input combiner submits the list of alternatives for the 
text selection to the natural language model. 

If optional step 716 is performed, then optional step 718 is 
also performed. In that step, the natural language models returns a 
ranked list of revised alternatives to the stochastic input combiner. 
The revised list includes a re -evaluation of the probabilities of 
the alternatives in the list of alternatives submitted to the 
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natural language model. The revised list may also contain new 
alternatives for the text selection that the natural language model 
forned. The natural language model creates this revised list using 
natural language principles, including an analysis of gramoatical 
and other language clues thai the natural language model applies to 
the ieit selection as a whole. 

In step 720. the stochastic input combiner chooses the top 
ranked alternatives for display. If optional steps 716 and 718 ^ere 
performed, then the stochastic input combiner chooses these 
alternatives from the revised list received from the natural 
language model. If not, then the stochastic input combiner chooses 
these alternatives from the list of alternatives that were created 
by either step 710 or step 714. After step 720, the routine ends 
at step 722. 

FIG. 8 shows the steps of routine 704 on FIG. 7. The routine 
illustrates the steps the stochastic input combiner follows to 
retrieve a stochastic model for each text component of the text 
selection originating from a stochastic input source. The routine 
begins at step 802, which effectively forms a loop for processing 
each text component. In step 802, the stochastic input combiner 
retrieves a text component. In step 804, the stochastic input 
combiner determines if that text component is a stochastic text 
component. If the retrieved component is not a stochastic text 
component, then step 806 is performed. In this case, the text 
component typically comprises typed text entered using a keyboard 
and mouse. Because the text component is non-stochastic, the 
stochastic input combiner assigns the text component a iOOX 
probability. The stochastic input combiner then performs step 818, 
which shall be discussed shortly. 
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Returning to step 804. if the slochaslic input coiabiner 
determines that the text coaponent retrieved in step 802 is 
stochastic, then the stochastic input combiner perfurms step 808. 
In step 808, the stochastic input combiner determines if the text 
conponent is derived from stochastic Dodels configured in series. 
If the text component is derived from a series of stochastic input 
sources, then the stochastic input conbiner performs routine 810 in 
order to derive a scries stochastic model that accurately represents 
the probabilities of the results produced by the last stochastic 
input source of the series. After routine 810, the stochastic input 
combiner performs step 812. Likcvise. If the stochastic input 
conbiner determines in step 808 that the text component retrieved in 
step 802 is not derived from a scries of stochastic models, step 812 
is performed. 

In step 812. the stochastic input combiner determines if the 
user has edited the text component using a lieyboard and mouse. If 
the text conponent has been edited, then the stochastic input 
combiner updates the corresponding stochastic model in step 814. If 
the stochastic model is a lattice, then updating it will include 
deleting any nodes corresponding to words that have been deleted 
fron the text component. Furthermore, the stochastic input combiner 
must add nodes for new words within the text component. Similarly, 
if the stochastic model is an "n-best" candidate list, the 
stochastic input combiner nust update each alternative of the list 
to remove words that have been deleted from the text component and 
add words that have been inserted into the text component. 

After step 814. the stochastic input combiner performs step 
816. The stochastic input combiner also performs step 816 if it 
determines in step 812 that the user has not edited the text 
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coBPoncnt. In step 818, the stochastic input combiner retrieves a 
stochastic model result for the text component that was selected in 
step 802. If the text component was derived from a series of 
stochastic models, then the stochastic model relrieved is the series 
stochastic model produced in step 810 or, if the text component has 
been edited, the series stochastic model that was updated in step 
814. The stochastic model retrieved nay be a lattice or an 'V 
best" candidate list. The stochastic model retrieved need only 
contain information aboal the selected text component, so the 
stochastic input combiner may retrieve the stochastic model from a 
larger stochastic model for a selection of text that includes the 
text component. 

The text component that was selected in step 802 may be 
derived from stochastic input, but a stochastic model representing 
alternatives for that text component may be unavailable. In that 
case, the text component can be treated the same as a non-stochastic 
text component. In other words, the stochastic input combiner 
assigns the known alternative for the text component a probability 
of lOOK. After step 816. the stochastic input combiner performs 
step 818. 

Step 818 can be reached from either step 816 or step 806. In 
this step, the stochastic Input combiner determines if there are any 
more text components in the text selection to process. If there are 
any more text components, then the routine loops to step 802 so the 
stochastic input combiner can get and process the next text 
component. 

When there arc no more text components to process in step 818. 
the stochastic input combiner optionally performs step 820 for 
incorporating the natural language model. In this step, the 
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stochastic input conbiner passes each stochastic model retrieved for 
a text coaponeiit to the natural language Dodel. The natural 
language model applies natural language principles to the text 
cooponents and returns then to the stochastic input combiner. 
Because the natural language model operates on individual text 
components in step 820. instead of on the entire text selection, 
step 820 may be performed either instead of steps Y16 and 718. or 
in addition to those steps. After step 820. the routine ends at 
step 822. 

FIG. 9 illustrates the steps of routine 810 on FIG. 8. This 
routine describes the steps the stochastic input combiner follo7s to 
derive a series stochastic model for a text component produced by 
stochastic input sources configured iu series. 

The routine 810 begins with step 902. vhich effectively 
begins a loop for processing in series order each of the stochastic 
input sources, except the last stochastic input source of the series. 
The first time the stochastic input combiner performs step 902. the 
stochastic input combiner selects the first stochastic input source 
in series order. This is the stochastic input source that receives 
the user input that ultimately i^ioduces the text component. 

Because a stochastic input source produces multiple 
alternative results, the first stochastic Input source produces 
multiple candidates for input into the second stochastic Input 
source in the series. If the stochastic input combiner Is not 
performing step 902 for the first time, then the stochastic input 
combiner will have produced a totalized candidate list in step 914 
(to be described shortly) for the stochastic input source selected 
in step 902. In the latter case, the totalized candidate list 
contains the alternatives associated with the selected stochastic 
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input source that are to be used as input into the subsequent 
stochastic input source of the series. Step 904 effectively begins 
a loop for processing all of the candidates associated vith the 
selected stochastic input source. In step 904. the stochastic input 
combiner retrieves one of the candidates for the selected input 
source. 

In step 906. the stochastic input combiner submits the 
candidate retrieved in step 904 as input into the subseQuent 
stochastic input source in series order. Inputting this candidate 
into the subsequent stochastic input source produces a stochastic 
result because the subsequent source is also stochastic. The 
stochastic input combiner retrieves this stochastic result. 

In step 908. the stochastic input combiner determines if the 
stochastic result retrieved in step 906 is a lattice. If the 
stochastic result retrieved in step 906 is not a lattice, then il is 
a ranked candidate list and step 912 (to be discussed shortly) is 
performed. If the stochastic result retrieved in step 906 is a 
lattice, then the stochastic input combiner must convert the lattice 
into a ranked candidate list of alternatives, with each alternative 
having an associated probability. This is done in step 910 before 
control passes to step 912. 

In step 912, the stochastic input combiner determines if 
there is another candidate for the selected source. If there is 
another candidate for the selected source, then the routine loops 
back to step 904 so thai the stochastic input combiner can get the 
next candidate. If there is not another candidate for the selected 
source, then step 914 is performed. 

In step 914, the stochastic input combiner combines all the 
candidate lists produced by using candidates from the input source 
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selected in step 902 as input into the subsequent stochastic input 
source in series order. This conbination (orns a totalized 
candidate list for the subsequent stochastic input source. The 
stochastic input coiibiner forms the totalized candidate list by 
making one entry for each unique candidate in the candidate lists 
being combined. Then, the stochastic input combiner calculates a 
probability for each alternative in the totalized alternatives list 
by sunning all probabilities assigned to that alternative in each of 
the candidate lists being combined. The stochastic input conbiner 
assigns each calculated probability to its associated alternative. 

In step 916. the stochastic input combiner determines if 
there is another source in the series after what is currently the 
subsequent source. If there is another source in the series, then 
the stochastic input combiner selects vhat is currently the 
subsequent source in step 902, and the next source after what had 
been the subsequent source becomes the subsequenl source At this 
point, the stochastic input combiner chooses candidates from the 
totalized candidate list for the selected input source as input inio 
the subsequent stochastic input source. 

Returning to step 916. if there is not another source in the 
series after the subsequent source, then the routine ends at step 
9L8. The totalized candidate list nost recently created in step 914 
is selected as the series stochastic model. 

FIG. 10 is a logical flow diagram illustrating typical steps 
of an alternative embodiment 1000 of a source data processing 
routine. Generally, the routine provides for automatically 
adjusting the unit of text corrected in response to a user' s text 
selection. 
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Routine 1000 begins vilh step 1006. In that step, the word 
processor 210 receives a text selection from the user which the user 
wants to correct. The user nay specify the text selection by 
selecting the word or group of words conprising the text selection 
with the House H2. Alternatively, (he user may specify a text 
selection consisting of a single word by using the mouse 142 to 
place the insertion point in or adjacent to the word. The word 
processor 210 may then submit the text selection to the stochastic 
input combiner 137 to determine correction alternatives. 

In step 1010, the stochastic input combiner 137 submits the 
text selection to a correction scope model to determine if the scope 
of correction should be adjusted Typically, adjusting the scope of 
correction involves identifying a text unit that will provide belter 
text correction alternatives lo the user than the text selection 
alone. For instance, the text selection may not include neighboring 
words that also contain errors which could be corrected together 
with the text selection. Usually, such errors in words neighboring 
a user' s text selection are identifiable because they relate to 
errors in the text selection. 

Accordingly, a text unit identified by the correction scope 
model may include the text selection plus one or more adjacent words. 
Instead of identifying only a single text unit for possible 
correction, the correction scope model may identify multiple text 
units, each of which are likely to yield good alternatives for text 
correction. 

In step 1015. the stochastic input combiner 137 receives from 
the correction scope model a list of text units for which correction 
alternatives should be provided to the user. If the correction 
scope model determined that the scope of correction need not be 
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adjusted, then the list of text units includes only the text 
selection. If the correction scope iaodei identified only one text 
unit for correction, the list of text units need include only that 
one text unit. Step 1020 effectively begins a loop for 
processing each of the text units identified in the list of text 
units that the correction scope model returned to the stochastic 
input combiuer 137 in step 1015, In step 1020. the combiner 137 
selects a text unit for processing. In step 1025. the combiner 137 
perforas the steps of the routine of FIG. 7 in order to deteminc 
alternatives for the selected text unit. One should understand that 
when the coabiner 137 porforiss the routine 1025 by performing the 
steps described in FIG. 7. "text selection" as used in FIG. 7 
refers to the selected text unit. 

In step 1030, the stochastic input combiner 137 determines if 
there are any more text units to process. If there are more text 
units, the routine loops back to step 1020 along the "YES" branch 
to process the next text unit. If there are no more text units, the 
"NO" branch is followed to step 1035. 

In step 1035. the stochastic input combiner 137 provides each 
of the correction alternatives and tbeir associated text units to 
the vord processor 210 for display. The word processor 210 nay 
display these alternatives in any appropriate manner. If the scope 
of correction was not adjusted, the alternatives may be displayed as 
described with respect to Fia 6. If the combiner 137 expanded the 
scope of correction to a single text unit, the word processor 137 
may highlight the additional words in the text to which the scope of 
correction was expanded in a different color than the color used to 
highlight the text selection, and the word processor may present the 
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alternatives for the text unit in a typical graphical user interface 
as described with respect to FIG. 6. 

Suppose that the correction scope nodel identified multiple 
text units for correction. In that case, the word processor 210 may 
present the user with a nenu of alternatives and identify the 
corresponding text unit for each alternative. 

After the word processor 210 presents the correction 
alternatives to the user through a graphical user interface, the 
routine ends at step 1040. The word processor 210 can then process 
the user' s response to the alternatives as described with respect 
to FIG. 6. 

Conclusion 

Other alternative toibodioients will become apparent to those 
skilled in the art to which an exeaplary embodinient pertains without 
departing from its spirit and scope. Accordingly, the scope of the 
present invention is defined by the appended claims rather than the 
foregoing description. 

4 Brief Description of Drawings 

FI& 1 is a block diagram illustrating the operating 

cnvironnent for an exemplary embodiment of the present invention. 

FIG: 2 is a block diagram providing an overview ol the 
program modules of a multi-source data processing system. 

PIG. 3 is a block diagram that Illustrates the operation of a 
typical embodiment of the present invention. 

FIG. 4 is block diagram that illustrates the operation of an 
embodiment of the present invention that allows the user to edit a 
text selection. 
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FIG. 5 is a block diagram that illBstrates another embodinent 
of the present invention in vhich a natural language node! is 
operative. 

FIG. 6 is a flow chart of the steps in a routine for 
processing source data fron multiple input sources. 

FIG. 7 is a flov chart of the steps in a routine for 
deternining alternatives for a multi-source text selection. 

FI& 8 is a flow chart of the steps in a routine for 
retrieving stochastic models for the text components in a text 
selection. 

FIG. 9 is a flow chart of the steps in a routine for deriving 
a series stochastic model. 

FIG. 10 is a flow chart of the steps in an alternative 
routine for processing source data that includes changing the scope 
of correction of a text selection. 

FIGS. II is block diagram that illustrates the operation uf 
an embodiment of the present invention that allows the user to edit 
a text selection. 
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1 Abstract 

A conpuler-implemented method for providing a candidate list 
of alternatives for a text selection containing text fron multiple 
input sources, each of which can be stochastic (such as a speech 
recognition unit, handwriting recognition unit, or input method 
editor) or non-stochastic (such as a l^evboard and nouse). A text 
componenl of the text selection may be the result of data processed 
through a series of stochastic input sources, such as speech input 
that is converted lo text by a speech recognition unit before being 
used as input into an input oethod editor. To detcralne 
alternatives for the text selection, a stochastic input combiner 
parses the text selection into text components from different input 
sources. For each stochastic text component, the combiner retrieves 
a stochastic model containing alternatives for the text component. 
If the stochastic text cooponent is the result of a series of 
stochastic input sources, the combiner derives a stochastic model 
that accurately reflects the probabilities of the results of the 
entire series. The combiner creates a list of alternatives for the 
text selection by combining the stochastic models retrieved. The 
combiner may revise the list of alternatives by applying natural 
language principles to the text selection as a whole. The list of 
alternatives for the text selection is then presented to the user. 
If the user chooses one of the alternatives, then the word processor 
replaces the text selection with the chosen candidate. 

2 Representative Drawing 

Fig. 1 



